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ABBREVIATIONS 

IVS  in  vitro  selection 

bya  billion  years  ago 

SL  Size  of  initial  library 

RF  Reactive  Fraction 

NRF  Non-reactive  Fraction:  NRF  =  SL*(\  -  RF) 

k„  Intrinsic  rate  of  molecule  "«":  each  member  of  the  initial  library,  n,  is 

assigned  an  intrinsic  rate,  k„,  which  relates  to  its  ability  to  be  cleaved 
under  selection  conditions.  The  intrinsic  rate  is  sequence-dependent  and 
therefore  unchanging. 

'Alifen  The  intrinsic  rate  of  molecule  "«"  (k„)  expressed  in  terms  of  half-life: 

'Alifen  =  ln2  /  k„*siowest  The  rate  constant  for  the  slowest  catalysts  in  the 
library. 

k/asiest  The  rate  constant  of  the  fastest  molecule  in  the  library. 

St  Selection  time:  the  amount  of  time  allowed  for  a  molecule  to  cleave  itself 

and  therefore  achieve  "survival." 

m  or  b  Distribution  function  for  reactive  molecules.  Two  functions  have  been 

tested  thus  far,  a  linear  function  with  slope  =  m,  and  an  exponential 
function  with  decay  rate  =  b. 

L  Leakage:  the  fraction  of  the  pool  that  "survives"  a  round  of  selection 

independent  of  St  and  intrinsic  rate  constant. 
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caged-ribose  a  ribose-adenosine  with  the  2 'OH  replaced  with  an  ortho-nitrobenzyl 

protecting  group 

r-primer  "ang  +  ribose"  49-nucleotide  substrate  containing  ribose 

r-X  oligonucleotide  "X"  with  ribose-adenosine 

d-X  oligonucleotide  "X"  with  deoxy-adenosine 

A  Deoxyadenosine  triphosphate 

G  Deoxyguanosine  triphosphate 

C  Deoxycytidine  triphosphate 

T  Deoxythymidine  triphosphate 

E-base  5-(3-aminoallyl)-2'deoxyuridine  triphosphate  (see  structure  below) 
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kcat(uni)  chemical  step  for  unimolecular  catalysis 

ki(unj)  rate  of  folding  for  unimolecular  catalysis 

k_i(Uni)  rate  of  unfolding  for  unimolecular  catalysis 

kcat(bi)  chemical  step  for  bimolecular  catalysis 

k  i(bi)  rate  of  association  for  bimolecular  catalysis 

k-i(bi)  rate  of  disassociation  for  bimolecular  catalysis 
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Our  current  understanding  of  the  origins  of  life  suggests  that  catalytic  nucleic 
acids  must  arise  from  random  polymers  of  nucleic  acids.  Searching  random  libraries  of 
nucleic  acid  polymers  has  not  produced  the  abundance  and  power  of  catalysis  believed 
necessary  to  spawn  life.  Further,  the  addition  of  chemical  functionality  to  random 
libraries  has  failed  to  improve  the  catalytic  potential  of  random  libraries  as  anticipated. 

This  apparent  discrepancy  between  observation  and  expectation  may  be 
attributable  to  several  shortcomings  of  the  experimental  design.  Active  catalysts  may  be 
lost  during  the  preparation  of  libraries,  or  during  the  enrichment  for  catalysts. 
Furthermore,  only  incomplete  descriptions  have  been  attempted  for  any,  focusing  only  on 
the  fastest  catalysts  in  the  library,  rather  than  capturing  completely  the  distribution  of 
catalytic  power.  It  is  conceivable  that  added  functionality,  or  alterations  of  the  library  or 
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selection  conditions,  can  dramatically  alter  the  abundance  and  distribution  of  catalysts 
without  altering  the  rate  of  the  fastest  catalyst. 

The  goal  of  this  thesis  is  to  develop  methods  for  estimating  the  distribution  of 
catalysts  within  a  random  library  of  DNA  sequence.  To  aid  in  the  development  of 
realistic  models,  we  began  with  an  in  depth  analysis  of  the  catalytic  behavior  of  an 
individual  DNAzyme,  614.  This  analysis  revealed  a  number  of  surprising  features  of  this 
DNAzyme  that  may  be  generalized  to  other  DNAzymes  created  through  in  vitro 
Selection  (IVS).  Although  selected  with  a  protocol  believed  to  enrich  for  Mg-dependent 
self-cleaving  catalysts,  614  cleaves  both  in  cis  and  trans  independently  of  Mg^.  The  rate 
of  /rans-cleavage  at  substrate  saturation  is  6-fold  higher  than  the  rate  of  m-cleavage. 
Further,  both  cis-  and  trans-cleavage  rates  are  enhanced  at  temperatures  below  25°C  (the 
temperature  at  which  the  selections  were  performed),  suggesting  that  a  balance  is  reached 
between  a  slower  chemical  step  (k^uni)  or  k^bo)  and  a  more  stabilized  association  step 
(ki(uni)  or  ki(bi))  at  lower  temperatures. 

Mutagenesis  of  614  and  its  substrates  was  performed  to  learn  about  the  structure 
of  614  cleaving  in  cis  and  trans.  Compensatory  mutations  revealed  that  one  of  the 
intermolecular  helixes  formed  between  614  and  its  substrate  is  identical  to  an 
intramolecular  helix  formed  in  ds-acting  614.  The  /raws-folded  614,  however,  has  an 
additional  4  base-pair  helix  not  found  in  cz's-folded  614,  and  this  may  partially  explain  the 
faster  rate  of  trans-cleavage  at  saturation. 

We  also  addressed  the  potential  loss  of  catalysts  by  "premature  catalysis"  during 
the  preparation  of  DNAzyme  pools  in  the  IVS.    Purification  of  single-stranded 
DNAzymes  was  accomplished  using  asymmetric  PCR  or  exonuclease  degradation, 


eliminating  the  need  for  column-based  methods  for  performing  IVS.  We  also  examined 
the  utility  of  a  protected  ribose  (caged-ribose)  for  both  kinetic  analysis  and  an  IVS. 
Selections  with  caged-ribose  failed  to  enrich  for  catalysts,  whereas  a  selection  with 
unprotected-ribose  produce  a  number  of  highly  active  catalysts.  Simulations  revealed 
that  the  slight  increase  in  background  cleavage  caused  by  laser  photolysis  explains  the 
failure  of  the  caged-ribose  to  enrich  for  catalysts. 

Analysis  of  the  catalysts  isolated  from  this  in  vitro  selection  differed  from 
sequences  isolated  in  previous  selections  by  only  a  few  nucleotides,  but  their  catalytic 
rates  varied  by  over  an  order  of  magnitude.  This  variation  suggests  that  the  sequence 
space  near  any  particular  active  catalyst  is  well-populated  with  other  active  catalysts. 

Simulations  of  in  vitro  selections  examined  the  impact  of  various  parameters  on 
the  outcome  of  selections.  These  simulations  demonstrated  that  when  the  selection  time 
is  similar  to  the  half  life  of  the  fastest  catalysts  in  the  initial  library,  only  the  fastest 
molecules  are  isolated.  Increasing  selection  time  such  that  it  is  significantly  greater  than 
the  half  life  of  the  fastest  catalysts  in  the  initial  population  creates  a  stabilized  population. 
The  proportionality  between  the  fastest  and  slower  catalysts  of  this  stabilized  population 
remains  similar  to  the  initial  population,  thus  allowing  direct  estimation  of  the  initial 
population  distribution.  The  model  developed  in  this  simulation  can  also  be  used  to 
estimate  the  distribution  of  catalysts  in  the  initial  library  for  future  in  vitro  selections. 
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CHAPTER  1 
INTRODUCTION 


Requirements  of  a  "Living"  System 
In  1994,  a  committee  empaneled  by  NASA  defined  life  as  "a  self-sustaining 
chemical  system  capable  of  undergoing  Darwinian  evolution"  (Joyce,  1994).  "Non- 
living" entities,  such  as  natural  phenomena  like  fire  or  crystal  formation,  often  have 
physical  properties  commonly  associated  with  life-forms  such  as  "energy  consumption," 
"ordered  structures,"  and  "growth  and  replication."  A  living  system  is  distinguished  from 
non-living  on  the  basis  that  living  systems  can  evolve.  The  properties  of  nonliving 
phenomena  such  as  fire  or  crystal  formation  follow  directly  from  physical  laws  and  the 
particular  environmental  conditions  of  the  system.  As  physical  laws  presumably  do  not 
change,  the  behavior  of  nonliving  systems  cannot  change.  An  evolving  system  has 
inherent  physical  properties  as  well,  but  in  contrast  to  nonliving  entities,  the  specific 
physical  properties  of  the  entire  system  are  derived  from  information  contained  within 
the  system  and  can  change  without  destroying  the  ability  to  evolve.  These  physical 
properties  are  either  intrinsic  to  the  information-containing  molecule  or  generated  from 
the  information). 

As  all  life  requires  obtaining  precursors  and  energy  from  an  environment,  the 
requirement  that  life  be  self-sustaining  is  always  relative  to  a  particular  environment. 
The  "NASA  definition"  also  limits  life  to  chemical  systems,  revealing  a  bias  towards  life 
as  we  know  it  and  in  which  we  are  most  interested.  Evolving  systems  such  as  memes 
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(cultural  ideas)  and  in  silico  evolving  systems  (software)  are  not  formally  recognized  as 
living  because  they  are  not  chemical.  The  information-storing  structure  for  memes  is 
poorly  understood,  and  this  may  contribute  to  the  biases  against  considering  this  evolving 
system  a  life-form.  For  in  silico  evolving  systems,  the  range  of  physical  properties 
("function  space")  is  arbitrarily  defined  by  the  programmer  and  therefore  not  open-ended 
like  a  chemical  system.  It  can  be  argued  that  the  "function  space"  for  a  chemical 
evolving  system  is  not  in  fact  open-ended  but  rather  limited  by  physical  laws. 
Ultimately,  both  memes  and  in  silico  evolving  systems  can  be  considered  the  product  of 
and  therefore  not  independent  of,  a  living  system.  One  may,  however,  conceive  of  life  as 
any  system  capable  of  Darwinian  evolution.  The  "NASA  definition"  of  life  might  then 
be  generalized  to  "any  informational-containing  system  that  undergoes  Darwinian 
evolution." 

Although  the  second  law  of  thermodynamics  dictates  that  the  universe  progress  to 
a  state  of  greater  disorder,  this  does  not  necessarily  result  in  the  loss  of  all  information. 
Within  a  system,  order  (information)  can  be  created  and  maintained  if  it  is  matched  by 
the  creation  of  greater  disorder  elsewhere  in  the  universe.  Nonliving  phenomena  often 
result  in  ordered  structures,  such  as  ripples  in  the  sand  of  a  tidal  basin,  the  lattice  structure 
of  a  crystal,  or  polymer  formation.  This  decrease  in  entropy  within  the  system  is  coupled 
to  free  energy  consumption.  In  order  to  persist  indefinitely,  an  informational  system 
must  prevent  information  decay  by  coupling  the  maintenance  of  ordered  structure  to  the 
release  of  available  energy. 

Two  such  mechanisms  for  preserving  encoded  information  are  easily  imagined: 
repairing  the  information  (requiring  reference  to  a  "correct"  version),  or  duplicating  the 


information  (requiring  duplication  efficiency/accuracy  be  greater  than  the  rate  of 
degradation). 

When  the  physical  properties  of  a  system  are  determined  by  the  information  it 
contains,  it  follows  that  the  physical  properties  of  the  system  can  change  according  to 
changes  in  the  encoded  information.  Such  a  system  may  never  settle  into  equilibrium  (as 
physical  systems  are  expected  to  eventually  reach).  Instead,  imperfect  information 
duplication  and  repair  changes  the  physical  properties  of  a  system.  This  eventually 
results  in  multiple  variants  of  an  information-containing  system,  each  with  variable 
physical  properties.  When  two  such  systems  utilize  the  same  resources  (energy  rich 
molecules,  elemental  precursors,  etc)  to  prevent  information  decay,  competition  exists 
among  systems  for  more  efficient  use  of  the  resource. 

Despite  competition  among  systems  for  common  resources,  the  ultimate  survival 
pressure  is  against  the  progression  towards  information  loss.  A  system  can  therefore 
persist  by  either  out-competing  rivals  for  a  common  resource,  or  by  occupying  a  new 
niche  (utilizing  a  different  resource).  Given  the  constant  progressive  force  towards 
information  decay,  there  exists  selection  and  enrichment  for  systems  that  enhance  the 
survival  of  their  information.  As  multiple  systems  compete  for  resources,  information 
systems  linked  to  replication-enhancing  functions  will  predominate. 

From  this  one  can  see  that  Darwinian  evolution  (which  is  the  fundamental  trait  of 
life)  is  the  direct  consequence  of  physical  laws  applied  to  an  information-containing 
system.  For  this  reason,  some  have  argued  that  once  a  minimal  system  (hypercycle) 
capable  of  Darwinian  evolution  exists,  it  will  generate  higher  orders  of  complexity 


spontaneously  and  inevitably  as  long  as  minimal  resources  are  available  and  a  major 
global  environmental  change  does  not  bring  about  extinction  (Kaufman,  1993). 

Two  specific  functional  attributes  follow  from  this  definition  of  life,  the  ability  to 
carry  heritable  information  (genetics)  which  is  linked  to  selectable  traits  (function). 
Selection  is  thus  acting  on  two  levels  simultaneously,  the  system  of  storing  information 
and  the  physical  properties  derivative  of  this  information.  Both  "information-storage" 
and  "function"  can  be  accomplished  using  a  myriad  of  chemical  and  physical  formats. 
However,  the  requirements  of  information  storage  and  duplication  using  chemical 
polymers  are  in  many  ways  contradictory  to  the  requirements  for  replication- 
enhancing  functions  (Benner  and  Switzer,  1999): 

•  Changes  in  the  information  must  not  impede  its  ability  to  be  duplicated,  whereas 
changes  in  information  must  allow  changes  in  function,  and  thus,  physical 
properties.  The  selectable  traits  (function)  of  a  polymer  are  determined  by  the 
three  dimensional  structure  (fold)  of  the  polymer  and  the  positioning  of  chemical 
moieties,  thus  a  functional  polymer  must  fold  to  generate  selectable  chemical 
traits.  In  contrast,  an  information  storage  system  should  not  fold  easily  as  this  is 
expected  to  impede  generalized  mechanisms  for  duplication. 

•  An  information  storage  system  should  have  few  subunits  to  ensure  greater  fidelity 
of  information  transfer,  whereas  a  functional  polymer  should  have  many  subunits 
with  diverse  functionalities  to  enhance  the  chemical  potential  of  the  polymer. 

•  For  a  system  to  explore  new  function  through  evolution,  chemical  properties  must 
be  able  to  change  rapidly  (with  few  changes)  and  abruptly.  Such  abrupt  changes 
in  the  chemical  properties  of  a  molecule  following  mutation  are  contradictory  to 
the  requirement  that  the  information  carrying  system  maintain  its  ability  to  be 
replicated  in  spite  of  significant  change  (mutation). 

Chemical  Challenges  to  the  Prebiotic  Synthesis  of  "Life" 
For  a  chemically  based  living  system  (which  is  the  only  undisputed  form  of  life 
we  know),  it  is  difficult  to  imagine  a  mechanism  for  storing  information  and  generating 
phenotypes  other  than  through  the  structure  of  polymers.  Given  this  restriction,  a  number 


of  requirements  must  be  met  for  the  prebiotic  (pre-Darwinian)  genesis  of  a  living 
(evolving)  system: 

(a)  Prebiotic  synthesis  of  the  monomers. 

(b)  Prebiotic  formation  of  polymers  from  monomers. 

(c)  Obtaining  catalysts  from  a  library  of  random  polymers. 

(d)  Genetics  -  obtaining  a  polymer  that  acts  as  a  heritable  carrier  of  information  for 
the  useful  catalysts. 

(e)  Compartition  -  linking  the  information  storing  system  (genetics)  to  the  catalytic 
function(s)  it  encodes. 

In  modern  biology,  the  contradiction  between  the  requirements  for  a  genetic 
biopolymer  and  a  catalytic  biopolymer  (see  above)  is  ameliorated  by  (mostly)  separating 
the  functions  of  genetics  and  catalysis  into  two  different  bio-polymers.  Nucleic  acid  is 
the  information  storing  biopolymer,  which  in  turn  is  used  to  produce  proteins  which 
generate  most  of  the  physical  properties  that  are  the  subject  of  natural  selection. 
Although  the  division  of  labor  between  two  distinct  bio-polymers  resolves  the 
contradictory  demands  of  a  life-form,  it  creates  a  significant  paradox  pertaining  to  the 
origins  of  life 

In  light  of  the  challenges  (a)  and  (b)  to  the  prebiotic  creation  of  life,  it  is  difficult 
to  imagine  a  non-biological  mechanism  for  generating  either  of  the  contemporary  bio- 
polymer systems  (nucleic  acid  or  protein).  It  is  astronomically  more  improbable  that  both 
bio-polymer  systems  would  arise  simultaneously  and  spontaneously,  and  furthermore  in  a 
manner  that  would  allow  their  mutual  integration/compatibility.  In  the  1960s,  theoretical 
consideration  of  this  dilemma  led  to  the  hypothesis  that  life  must  have  originated  as  a 
single  biopolymer  systems  (Rich,  1962).    This  proposal  requires  only  one  successful 
solution  to  the  challenge  of  obtaining  useful  monomers  and  polymers.  Furthermore,  for  a 
single  gene/enzyme  system,  the  challenge  of  compartition  is  resolved  because  the  two 


functional  requirements  are  held  simultaneously  in  a  single  molecule  (the  challenge  of 
compartition  remains  when  the  system  under  selection  contains  multiple  gene/enzymes). 
This  solution,  however,  creates  another  challenge:  Such  a  biopolymer  must  meet  the 
contradictory  challenges  of  (c)-catalysis  and  (d)-genetics  simultaneously. 

Progress  Towards  Understanding  the  Genesis  of  Life:  Dual  Function  Biopolvmers 
Two  approaches  have  since  been  pursued  to  gain  insight  about  how  to  meet  the 
challenges  to  the  origins  of  life,  one  which  begins  with  modern  life-forms  and  works 
backwards  to  deduce  information  about  its  origins,  while  the  other  approach  begins  with 
prebiotic  chemistry  and  attempts  to  work  forwards  towards  life. 

Much  insight  has  been  deduced  about  early  life  from  an  analysis  of  its  modern 
descendents.  Although  modern  life-forms  have  nucleic  acids  specialized  for  genetics  and 
proteins  specialized  for  catalysis,  there  exist  many  idiosyncrasies  about  modern 
metabolism  that  are  suggestive  of  a  previous  incarnation  as  a  single-biopolymer  life-form 
based  on  RNA:  most  biological  coenzymes  are  based  on  nucleotides,  nucleic  acids  are 
used  to  the  generate  proteins;  histidine  is  biosynthesized  from  nucleotides 
(phosphoribosyl  pyrophosphate  and  ATP);  RNA  is  a  key  component  of  the  spliceosome, 
ribosome,  and  RNAseP;  RNA  serves  as  the  primer  for  DNA  synthesis;  DNA  is 
synthesized  by  protein-based  reduction  of  RNA  (a  chemical  transformation  that  is  quite 
difficult,  and  unlikely  to  occur  via  prebiotic  mechanisms)  (Joyce,  1989;  Benner,  1989). 
The  use  of  RNA  for  chemical  purposes  for  which  it  is  not  intrinsically  suited,  particularly 
in  the  modern  environment  where  chemically  better-suited  molecules  could  be  made 
(proteins),  argues  against  convergent  evolution  and  instead  suggests  that  these  RNA 


vestiges  are  "molecular  fossils"  leftover  from  a  time  in  history  when  RNA  was  the  sole 
biopolymer  (as  opposed  to  convergent  evolution)  (Gilbert,  1986).  These  considerations 
offer  support  for  the  previous  theoretically-based  proposition  that  modern  life-forms 
passed  through  a  single  biopolymer,  RNA-based  life-form,  a  period  of  time  known  as  the 
"RNA  world"  (Woese,  1967;  Crick  1968;  Orgel,  1968).  Reconstructions  of  the  RNA 
world  indicate  that  RNA  life-forms  possessed  quite  complex  metabolism:  molecular 
fossils  indicate  that  the  RNA  world  developed  the  RNA  cofactors  ATP,  coenzyme  A,  S- 
adenosylmethionine,  and  NADH.  lit  follows  that  the  RNA  world  needed  these, 
presumably  for  phosphorylations,  Clasisen  condensations,  methyl  transfers  and 
oxidation-reduction  reactions  (Benner,  1989). 

The  discovery  that  modern  life  contains  RNA  that  acts  as  a  catalyst  demonstrated 
that  RNA  is  in  fact  a  single  bio-polymer  capable  of  both  genetics  and  catalysis,  thus 
giving  experimental  support  to  the  proposition  that  modern  two-biopolymer  life  is 
descendent  from  a  single-biopolymer  (RNA)  life-form  (Usher,  1976;  Cech,  1981, 
Geurrier-Takada,  1983).  The  existence  of  an  "RNA  world"  does  not  necessitate, 
however,  that  life  actually  originated  with  RNA.  Indeed,  applying  chemical  knowledge 
to  "work  forward"  from  prebiotic  earth  to  modern  life  suggests  that  ribonucleic  acids  may 
have  been  a  very  poor  candidate  as  the  primordial  monomer  in  the  origins  of  life. 
Instead,  the  RNA  world  may  have  served  as  a  transitory  life-form  between  the  original 
single-polymer  life-form  and  today's  robust,  two-biopolymer  life-form.  We  now  turn  our 
attention  to  insights  gained  by  "working  forward"  from  basic  chemical  principles. 


Progress  Towards  Understanding  the  Genesis  of  Life:  Prebiotic  Synthesis 
of  Potentially  Useful  Monomers  and  Polymers 

The  challenge  of  non-biological  synthesis  of  useful  monomers  and  polymers 
argues  against  the  simultaneous  creation  of  two  interrelated  polymeric  systems,  but  the 
creation  of  even  a  single  polymeric  system  for  life  (capable  of  genetics  and  catalysis)  still 
must  meet  the  need  for  prebiotic  synthesis  of  monomers  and  polymers.  Knowing  that 
chemically  based  life  does  exist  we  must  assume  that  monomers  and  polymers  did  form 
in  the  prebiotic  world  as  result  of  the  basic  principles  of  chemistry.  It  has  not,  however, 
been  easy  to  deduce  how  this  might  occur. 

In  light  of  the  challenges  to  the  prebiotic  generation  of  life,  what  is  so  surprising 
about  the  origins  of  life  (other  than  that  it  happened  at  all)  is  that  it  appeared  relatively 
rapidly.  The  earth  was  formed  -4.6  billion  years  ago  (bya).  The  first  -0.5  billion  years 
of  earth's  life,  prior  to  the  cooling  of  the  earth's  crust  and  the  subsiding  of  severe 
meteorites  bombardment,  was  inhospitable  to  the  formation  of  any  useful  monomers,  yet 
evidence  of  life  in  the  fossil  record  is  apparent  as  early  as  3.6  bya  (Walter,  1983).  This 
provides  a  very  short  time  for  the  generation  of  monomers,  polymers  and  subsequent 
genesis  of  life. 

In  order  to  apply  chemical  principles  to  gain  insight  into  the  prebiotic  creation  of 
monomers,  the  prebiotic  environment  of  the  earth  (approximately  4  billion  years  ago) 
must  be  understood.  Although  definitive  information  about  the  earth's  early  environment 
and  composition  is  slow  coming,  it  is  widely  assumed  that  H2O,  CO2,  H2,  and  CO 
existed.  In  a  groundbreaking  experiment,  Stanley  Miller  demonstrated  that  combining 
these  basic  elements  with  an  electric  discharge  (presumably  also  occurring  with  sufficient 
regularity  on  the  prebiotic  earth)  produced  a  prebiotic  soup  containing  a  mixture  of  key 


materials,  most  notably  HCN  (Miller  and  Urey,  1959;  Sclesinger  and  Miller,  1983; 
Stibling  and  Miller,  1987).  The  prebiotic  presence  of  HCN  is  further  supported  by  its 
presence  in  intrastellar  clouds,  in  comets,  and  in  the  atmosphere  of  Titan  (Irvine,  1999; 
Huebner,  et  al,  1974;  Hanel,  1981). 

Base-catalysed  tetramerization  of  HCN  yields  diaminomaleonitrile,  which  then 
leads  to  the  production  of  adenine,  hypoxanthine,  guanine,  xanthine,  and  diaminopurine. 
Hydrolysis  of  HCN  oligomers  yields  amino  acids,  particularly  glycine,  alanine,  aspartate, 
and  diaminosuccinate,  among  other  amino  acids  (reviewed  in  Joyce,  1989).  Other 
electric  discharge  experiments  under  presumably  prebiotic  conditions  generated  purines, 
among  other  potentially  useful  materials.  These  experiments  usually  have  low  yields  of 
pyrimidine  synthesis,  but  recent  experiments  of  Miller  demonstrate  that  significant 
pyrimidine  yield  from  frozen  dilute  samples  of  NH4CN  (Miyakawa,  et  al  2002). 

Another  prebiotic  molecule,  H2CO,  can  lead  to  glyceraldehydes  through 
condensation  in  the  presence  of  a  catalyst  such  as  calcium  carbonate  or  alumina  (Gabel, 
1967;  Reid,  1967).  Glyceradehyde  can  then  begin  a  cascade  that  converts  formaldehyde 
into  trioses,  tertroses,  and  larger  sugars  through  a  process  of  aldol  condensation  and 
enolization  (Joyce,  1989). 

Joining  nucleoside  and  sugar  subunits  has  been  accomplished  by  heating  a 
mixture  of  ribose  and  purine  to  dryness  in  the  presence  of  inorganic  salts;  this  process, 
however,  leads  to  a  mixture  of  a-  and  P-nucleosides  (Fuller,  et  al,  1972). 

In  summary,  experimental  evidence  suggests  that  the  chemical  environment  and 
composition  of  the  prebiotic  earth  is  sufficient  to  provide  the  necessary  mechanisms  and 
materials  for  the  synthesis  of  many  potentially  useful  monomers.  These  findings  are 
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supported  by  the  discovery  of  similar  monomers  on  meteorites,  which  were  also 
presumably  synthesized  by  prebiotic  mechanisms  in  space.  The  next  step  towards  a 
living  system  requires  that  monomers  be  linked  together  by  prebiotic  chemistry  to  form 
polymers. 

Limited  polymerization  of  nucleic  acids  has  been  observed  under  prebiotic 
conditions.  Adenosine  2'-,  3 '-cyclic  phosphate  can  be  converted  to  oligomers  by  heating 
to  dryness,  or  by  incubating  at  room  temperature  in  the  presence  of  a  polyamine 
(Verlander,  1973).  Nucleoside  5 '-polyphosphates  have  also  been  produced  by  heating 
nucleotides  and  inorganic  polyphosphates  to  dryness  (Lohrmann,  1975).  The  random 
polymerization  of  monomers,  however,  remains  a  great  challenge,  specifically  in  light  of 
the  abundance  of  stereo-chemically  related  compounds  that  are  expected  to  interfere  with 
prebiotic  polymerization.  Taking  ribose-based  nucleotides  as  an  example,  nucleotide 
polymerization  can  result  in  a  combinatorial  mixture  of  2'-,  5'-,  3'-,  5'-,  and  5'-,  5'- 
phosphodiester  bonds,  with  variable  numbers  of  phosphates  between  the  sugar  bases, 
combined  with  the  D-  and  L-  sugar  stereoisomers  and  a-  and  p-anomers  of  the  glycosidic 
bonds.  Most  polymers  generated  are  expected  to  have  heterogeneous  linkages,  which  is 
expected  to  make  replication  of  the  polymer  exceedingly  more  difficult.  Several 
mechanisms  may  have  existed  to  overcome  stereoisomeric  interference,  generally  relying 
on  some  mechanism  of  enriching  for  specific  isomeric  forms  of  monomers  or  preferential 
incorporation  of  certain  monomeric  isoforms.  Although  prebiotic  formation  of  both 
monomers  and  polymers  is  likely  to  result  in  a  clutter  of  stereoisomers,  a  series  of  biased 
synthesis  followed  by  fractionations  or  enrichments  may  have  provided  sufficient 
building  blocks  for  creating  life. 
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It  is  interesting  to  note  that  not  all  (potentially)  prebiotic  monomers  suffer  as 
many  of  the  stereoisomeric  challenges  as  ribose-based  nucleotides.  Both  threose  nucleic 
acid  (TNA)  and  peptide  nucleic  acid  (PNA)  use  alternative  bases  attached  to  nucleic 
acids.  Both  monomers  are  more  chemically  simplistic,  thus  avoiding  much  of  the 
stereoisomeric  clutter  of  RNA.  This  has  led  to  proposals  of  a  pre-RNA  world  utilizing 
either  polymer  as  potentially  life-supporting  polymers.     Theoretical  and  experimental 
evidence  suggest  that  information  stored  in  the  form  of  TNA  or  PNA  polymers  can  be 
transferred  to  RNA,  although  it  is  unclear  how  much  phenotypic  function  is  preserved  in 
the  transfer  (reviewed  in  Joyce,  2002). 

The  Next  Challenge:  Obtaining  Catalytic  Function  from  Random  Libraries 
Although  monomer  and  polymer  synthesis  is  expected  to  be  difficult  in  a  prebiotic 
world,  it  is  the  formation  of  polymers  that  act  as  genetic  molecules  with  useful  chemical 
properties  that  appears  most  formidable  (useful  from  the  perspective  that  it  leads  to  the 
creation  of  life:  an  initially  useful  property  would  be  the  ability  to  enrich  for  specific 
monomers).  This  requires  that  polymers  be  of  sufficient  length  to  contain  sufficient 
information,  be  sufficiently  stable  as  to  persist,  be  able  to  mutate  yet  still  be  replicated, 
and  lastly,  perform  useful  chemical  function. 

Although  we  do  not  know  precisely  what  monomers  were  available  in  a  priobiotic 
world  or  how  they  were  synthesized  and  polymerized,  it  is  clear  that  at  some  point  ~  4 
bya  some  randomly  generated  prebiotic  polymers  must  have  possessed  "useful" 
properties  (presumably  a  self-replicase)  that  permitted  the  jump  from  a  collection  of 
random  polymers  to  a  single-biopolymer  life-form  with  a  complex  metabolism. 
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Although  this  is  the  most  reasonable  hypothesis,  it  is  difficult  to  imagine  that  such  a 
complex  function  could  be  generated  from  a  random  polymer.  An  experimental 
approach,  in  vitro  selection  (IVS),  emerged  in  the  1980s  that  served  as  a  bridge  between 
the  "work  forward"  and  "work  backward"  approaches  by  permitting  a  direct  test  of  the 
catalytic  power  of  random-polymer  libraries  (Szostak,  1992;  Beaudry,  1992;  Irvine, 
1991). 

IVS  begins  with  a  library  containing  ~1013  unique  and  randomly  generated  polymers  of 
DNA.  This  library  is  then  challenged  by  a  selection  process  (such  as  the  ability  to 
catalyze  a  particular  reaction  or  bind  a  particular  molecule).  Molecules  that  can  perform 
the  selected  task  are  separated  from  those  that  do  not  (Figure  1-1).  Finding  molecules  in 
a  random  library  that  perform  the  selected  task  is  therefore  a  function  the  abundance  of 
that  activity,  and  the  ability  of  the  selection  regime  to  separate  active  from  inactive 
molecules. 


Combinatorial  library 
of  DNA  molecules 


Solid  support 


Random  region 


n 


HL 


F  LLLL-LI '  '  ■  1 1 1 1 1 1  1 1 1 1 1 J  1 1 1 1 1 1  t  [  1 1  n  I 


U 

Denaturation  and  elution 
of  complementary  strand 
.3' 


5' 


Binding  of  PCR 
products  to  solid 
support 





Reaction  Buffer 
vCofactor  (Mg++) 


Cleavage  site 


i      inn  ^ 


Catalytic  Motif 


PCR  amplification 


-B 


5- 


Elution  of  cleaved 
Molecules 


3' 


Figure  1-1.  A  schematic  diagram  describing  in  vitro  selection  for  a  DNA  molecule 
that  catalyzes  a  cleavage  reaction. 
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In  practice,  the  selection  process  is  not  perfect,  allowing  inactive  molecules  to 
survive  (and  presumably  losing  some  active  molecules).  When  active  molecules  are  rare 
and  background  is  high,  several  sequential  selections  are  required  to  enrich  for  active 
molecules.  Experimentally,  IVS  accomplishes  sequential  rounds  of  selection  by  using 
the  surviving  molecules  from  the  first  round  of  selection  as  a  template  for  PCR,  thus 
amplifying  the  survivors  and  creating  a  second  pool  of  DNA  molecules  (that  are 
presumably  enriched  for  catalysts)  for  a  subsequent  round  of  selection. 

This  experimental  strategy  is  a  close  analogy  to  what  is  imagined  to  have  led  to 
the  first  life-form  (with  the  exception  that  prebiotic  mechanisms,  as  opposed  to  human 
intervention,  lead  to  the  development  of  the  polymer  library,  and  "selection  for  desired 
activity"  was  imposed  by  the  Earth's  environment  and  processes  which  lead  to  the 
decomposition  of  polymers).    It  was  anticipated  that  IVS  would  now  demonstrate  that 
the  chemical  processes  necessary  for  life  could  in  fact  be  found  in  a  prebiotic  library  of 
polymers.  After  a  decade  of  experiments  and  great  effort,  many  catalysts  have  indeed 
been  isolated  from  random  libraries,  most  notably  a  phosphodiesterase,  a  Diels-Alderase, 
a  nucleotide  ligase,  and  a  limited  polymerase  (Breaker  and  Joyce,  1994;  Johnston,  2001; 
Seeling,  1999;  Tarasow,  1997). 

Despite  these  successes,  getting  self-replicating  nucleic  acid  polymers  from  IVS 
proved  harder  than  anticipated/hoped.  Bartel  and  others  are  coming  close  to  achieving 
this  goal  (Johnston,  2001).  It  must  be  noted,  however,  that  even  the  authors  of  this 
research  contribute  the  success  of  the  de  novo  generation  of  an  RNA  based  RNA 
polymerase  to  a  stepwise  procedure  guided  by  human  chemical  knowledge:  "Thus  far, 
efforts  to  select  for  polymerization  activity  in  a  single  step  directly  from  random-sequence 
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RNA  have  yielded  only  ribozymes  that  decorate  themselves  inappropriately  with  tagged 
nucleotides"  (Johnston,  2001,  page  1324).  Furthermore,  estimates  from  their  work 
suggest  that  more  than  10^0  random  RNA  sequences  might  be  required  to  get  template- 
directed  RNA  polymerization  from  a  truly  random  pool.  More  directly,  Bartel  and  Szostak 
noted  that  the  odds  of  obtaining  a  ligase  able  to  enhance  a  very  simple  templated  ligation 
reaction  by  four  orders  of  magnitude  were  on  the  order  of  one  in  10^  (Bartel,  1993). 

As  mentioned  earlier,  it  is  not  certain  that  life  originated  from  RNA  polymers, 
indeed  many  other  candidate  polymers  are  imaginable.  In  support  of  this,  it  is  notable 
that  catalytic  activity  has  been  isolated  from  various  different  types  of  libraries,  including 
amino  acid,  ribonucleic  acid,  and  deoxynucleic  acid  polymers.  Extrapolations  from  a 
search  of  amino  acid  libraries  estimate  that  in  a  fully  randomized  library,  a  library  of  10 
members  would  be  required  to  obtain  an  active  catalyst  (in  that  case,  AroQ  mutase) 
(Taylor,  et  al  2001).  This  search  of  amino  acid  libraries  concluded  that,  although  the 
diversity  of  functionality  in  proteins  provides  intrinsically  greater  catalytic  potential, 
folding  proteins  may  be  inherently  more  difficult  than  folding  nucleic  acid  polymers, 
therefore  making  an  active  enzyme  from  a  random  polymer  of  amino  acids  appears  to 
substantially  more  difficult  that  obtaining  a  ribozyme  from  a  random  nucleic  acid  library. 

Although  natural  amino  acid  polymers  (proteins)  function  superbly  as  catalysts, 
and  amino  acid  catalysts  have  been  isolated  from  random  libraries,  amino  acid  polymers 
are  not  well  suited  as  carriers  of  genetic  information  for  reasons  described  earlier.  In 
vitro  selections  using  amino  acid  libraries  are  also  much  more  technologically  limited 
than  searches  of  nucleic  acid  sequence  space.  For  all  these  reasons,  and  the  indication 
that,  although  life  on  earth  may  not  have  started  with  RNA,  it  clearly  passed  through  a 
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one-biopolymer  stage  based  on  RNA,  in  vitro  selections  have  largely  focused  on  nucleic 
acid  biochemistry. 

More  than  a  decade  ago,  Benner  proposed  what  has  come  to  be  the  standard 
explanation  for  the  "poor"  catalytic  power  of  nucleic  acids  (where  "poor"  reflects  in  part 
an  unbalanced  comparison  with  natural  protein  enzymes)  (Benner,  1987;  Benner,  1988b). 
Nucleic  acids,  he  argued,  have  little  organic  functionality,  at  least  compared  to  natural 
protein  enzymes.  DNA  lacks  in  the  scaffold  cationic  groups,  imidazoles,  thiols,  and 
carboxylates,  all  of  which  play  effective  catalytic  roles  in  protein  enzymes  near  neutral 
pH.  To  relieve  these  limitations,  Benner  suggested  expanding  the  genetic  alphabet  for  the 
purpose  of  permitting  DNA  to  carry  more  of  these  functional  groups  (see  Switzer,  1989; 
Piccirilli,  1990). 

This  hypothesis  gains  support  from  the  observation  that  many  of  the  catalytic 
RNAs  believed  to  be  vestiges  of  the  RNA  world  (spliceosome,  RNaseP,  ribosome)  contain 
post-transcriptional  modifications;  the  conservation  of  this  difficult  task  both  throughout  4 
billion  years  of  evolution  and  across  wide  phylogenic  taxa  suggests  that  these 
modifications  must  play  a  critical  role  in  the  function  of  these  catalytic  RNA.  Evidence 
for  the  prebiotic  synthesis  of  C-5  functionalized  uracil  -  including  ammonia,  glycine, 
guanidine,  hydrogen  sulfide,  hydrogen  cyanide,  imidazole,  indole  and  phenol  -  also  lends 
credence  to  the  hypothesis  that  functionalities  appended  to  nucleic  acids  may  have  served 
a  crucial  role  in  the  origins  of  catalytic  nucleic  acids  and  subsequent  life  (Robertson  and 
Miller,  1995). 

Several  groups  (Battersby,  1999;  Latham,  1994;  Sakthivel,  1998;  Perrin,  1999; 
Perrin,  2001)  have  now  synthesized  functionalized  nucleic  acids,  mostly  derivatives  of  2'- 
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deoxyuridine,  and  used  them  in  IVS  experiments  in  an  effort  to  demonstrate  that  adding 
functionality  increases  the  odds  of  finding  catalytically  powerful  sequences  in  a  DNA 
library.  Judging  from  published  examples,  it  seems  to  be  now  accepted  that  adding 
functionality  typically  increases  the  catalytic  potential  of  a  DNA  library  perhaps  by  a 
factor  of  two  to  ten,  not  by  orders  of  magnitude. 

In  a  recent  example,  Joyce  and  co-workers  selected  a  zinc-dependent  RNAse 
activity  utilizing  an  imidazole-functionalized  nucleotide  in  their  library  (Santoro,  2000). 
No  significant  catalytic  boost  was  detected,  however,  over  DNAzymes  isolated  earlier  by 
the  same  researchers  using  non-functionalized  nucleotides  (Santoro,  1997).  Together  with 
other  similar  findings,  this  suggested  that  functional  endowment  offers  less  than  an  order 
of  magnitude  enhancement  of  the  catalytic  potential  of  the  library  (Ang,  1999;  Ang, 
2000).  Analogous  experiments  where  functionality  was  introduced  as  a  cofactor,  binding 
to  but  not  covalently  attached  to  the  DNA  molecules,  generated  analogous  conclusions 
(Roth,  1998). 

What  appears  to  be  absent  from  the  literature  is  the  sense  of  astonishment  at  these 
results.  Functional  groups  are  supposed  to  be  important  in  organic  reactivity.  For 
example,  in  an  experiment  in  which  a  polypeptide  was  generated  that  folded  in  solution 
and  catalyzed  the  decarboxylation  of  oxaloacetate  (Johnsson,  1990;  Johnsson,93),  the 
amino  groups  performed  the  catalysis.  Having  them  mattered;  not  having  them  mattered 
even  more,  as  catalysis  went  down  by  several  orders  of  magnitude  (to  background)  when 
they  were  removed.  In  chemical  models,  adding  imidazole  (for  example)  increases 
reactivity  of  catalytic  models  by  orders  of  magnitude,  not  by  a  factor  of  two  to  ten  (Zepik, 
1999). 
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When  organic  chemical  theory  expects  a  large  (orders  of  magnitude)  impact  of  a 
change  of  structure  on  molecular  behavior,  and  only  a  small  impact  is  seen  (a  factor  of  2 
to  10),  something  must  be  wrong.  Therefore,  this  "orders-of-magnitude"  difference  in 
expectation  versus  outcome  needs  to  be  pursued.  Perhaps  the  most  formidable  challenge 
to  our  understanding  of  the  origins  of  life  lies  in  the  experimental  indication  that  function 
is  excessively  rare  in  random  polymer  libraries.  Therefore,  if  a  plausible  model  for  the 
origin  of  life  is  ever  to  emerge  based  on  an  "RNA- world"  scenario,  we  need  to  understand 
why  functionality  does  not  seem  to  significantly  enhance  the  catalytic  power  of  the 
library. 

Although  a  key  obstacle  in  the  origins  of  life  is  the  creation  of  function  from 
random  libraries,  none  of  the  IVS  experiments  conducted  thus  far  have  been  designed  to 
directly  estimate  the  frequency  of  function  within  a  library,  or  determine  directly  how  this 
frequency  changes  with  various  types  of  libraries  and  under  varying  selection  schemes. 
Most  in  vitro  selections  are  designed  to  determine  if  a  desired  activity  exists  in  a  library, 
and  therefore  are  content  to  isolate  a  few  molecules  with  the  desired  activity.  The 
previously  used  methods  of  estimating  the  frequency  of  catalysts  in  a  library  are  not 
comprehensive  (Johnston,  2001,  Bartel,  1993,  Taylor,  2001) ,  thus  the  few  estimates  of 
catalytic  frequency  provided  are  not  to  be  considered  realistic.  Indeed  it  is  difficult  to 
imagine  life  emerging  spontaneously  from  nucleic  acids  given  these  kinds  of  numbers, 
especially  given  the  difficulties  in  obtaining  RNA  (or  DNA)  in  polymeric  form  under  any 
plausible  prebiotic  reaction  conditions  in  water.  Initially,  the  limited  goal  of  isolating  any 
activity  from  a  library  was  understandable  given  that  it  was  not  known  whether  any  such 
activity  existed  in  a  random  library.  Knowing  that  activity  can  be  isolated  from  random 
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libraries,  the  next  critical  task  of  estimating  the  frequency  of  such  activity  in  a  random 
library,  and  determining  what  factors  alter  catalytic  abundance  and  power,  can  be 
addressed.  Understanding  this  is  the  next  vital  step  to  understanding  how  to  make  the 
transition  from  prebiotic  polymers  to  a  living  system. 

In  order  to  understand  how  a  single-biopolymer  life-form  is  created  by  prebiotic 
mechanisms,  we  must  understand  the  factors  that  alter  the  probability  of  creating  function 
from  random  polymers,  specifically  the  apparent  contradication  that  added  functionality 
doesn't  appear  to  significantly  improve  catalysis.  Five  categories  of  hypotheses  might  be 
considered  to  account  for  this  orders-of-magnitude  difference  between  expectation  and 
outcome  when  adding  functionality: 

(a)  Our  view  of  the  role  of  functionality  in  catalysis  may  be  naive.  Functional  groups 
may  not  greatly  enhance  catalytic  power. 

(b)  Nucleic  acids  are,  of  course,  already  functionalized  (with  phosphates  and  hydrogen 
bond  donating  and  accepting  groups).  Perhaps  these  are  the  only  functionalities 
needed  for  catalysis. 

(c)  Nucleic  acids,  although  they  do  not  have  many  functional  groups  intrinsically  in 

their  covalent  structure,  may  recruit  sufficient  numbers  of  these  (in  particular, 

++ 
divalent  cations  such  as  Mg    )  as  "cofactors"  to  make  up  for  the  lack  of 

functionality  covalently  linked  to  the  scaffolding. 

(d)  The  IVS  experiments  may  have  been  designed  in  a  way  as  to  overlook  or  lose  the 
best  functionalized  catalysts. 

(e)  Perhaps  adding  a  single  type  of  functional  group  is  not  sufficient;  one  may  need  to 
add  two  or  more  types  of  functional  groups  before  the  expected  large  benefit  from 
functionality  is  seen  (Perrin,  1999;  Perrin,  2001). 

A  priori,  each  hypothesis  has  its  attractions  and  disadvantages.  For  example,  we 
can  easily  identify  ways  in  which  the  best  endowed  catalysts  would  be  lost  in  a  Breaker- 
Joyce  protocol  (see  below),  suggestive  of  hypothesis  (d).  Divalent  metal  ions  do  bind  to 
DNA,  they  can  participate  in  catalysis,  and  they  assist  even  more  highly  functionalized 
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protein  catalysts,  suggestive  of  hypothesis  (c).  Proteins  do  seem  to  exploit  multiple  types 
of  functionality,  and  exploit  hydrophobicity,  something  lacking  in  aliphatic  form  in  DNA, 
suggestive  of  hypothesis  (e).  Last,  recent  experiments  by  Keefe  and  Szostak  selecting  for 
proteins  that  do  things  suggests  that  they  do  not  do  things  orders  of  magnitude  better  than 
DNA  (2001),  suggestive  of  hypothesis  (a)  or  (b). 

Experimental  Design 

This  thesis  presents  the  first  step  towards  addressing  the  hypotheses  above.  We 
utilized  the  experimental  procedure  developed  by  Breaker  and  Joyce  (1994,  1995).  This 
IVS  procedure  seeks  DNA  molecules  that  are  catalytically  active  as  ribonucleases;  this 
protocol  is  the  most  widely  utilized  procedure  for  estimating  catalytic  performance  under 
various  conditions,  allowing  general  comparisons  such  as  the  ones  mentioned  before  that 
suggest  functionality  does  not  significantly  improve  catalytic  potential.  As  mentioned 
above,  one  explanation  for  the  apparent  failure  of  functional  endowment  to  generate  very 
active  deoxyribozymes  may  result  from  the  loss  of  highly  reactive  catalysts  during  some 
step  in  the  experiment  before  they  had  the  opportunity  to  survive  the  selection  step. 

It  is  easy  to  find  places  in  the  set-up  of  the  Breaker- Joyce  selection  procedure 
where  highly  active  catalysts  might  be  lost.  Standard  IVS  experimental  systems  assume 
that  catalysts  remain  inactive  during  the  work  up  procedure,  which  includes:  PCR  (with 
MgCb),  ethanol  precipitation  (with  high  salts),  binding  to  a  column  (again  in  high  salts, 
but  without  MgCb)  and  removing  the  complementary  strand  with  high  pH  sodium 
hydroxide.  After  work  up  is  completed,  the  reaction  is  "started"  with  the  addition  of 
MgCh  and  the  eluate  from  a  column  is  collected  and  presumed  to  contain  the  active 
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catalyst.  There  are  several  technical  weaknesses  of  the  Breaker- Joyce  IVS  protocol  that 
may  result  in  the  loss  of  desired  catalysts. 

First,  columns  have  poor  resolving  ability,  in  part  because  they  have  great 
potential  for  residual  retention  (unbound  material  slowly  filtering  through  the  column) 
and  leakage  (bound  material  becoming  unbound).  It  is  also  assumed  that  even  the  "best" 
catalysts  remain  inactive  until  the  reaction  is  "started"  by  adding  MgCb  (or  other 
cofactor,  such  as  histidine)  and  collection  begins.  Magnesium  is  in  fact  present  during 
the  preparation,  including  during  the  PCR,  and  other  salts  are  also  present  during  the 
ethanol  precipitation  -  potentially  permitting  catalysis  during  the  preparation  steps  and 
resulting  in  loss  of  catalysis.  DNAzymes  have  been  isolated  using  the  Breaker- Joyce 
protocol  that  are  active  without  any  added  cofactor  (requiring  only  NaCl),  offering 
support  to  the  hypothesis  that  active  catalysts  may  be  lost  during  the  workup. 

Second,  the  activity  of  a  DNAzyme  is  inhibited  by  the  complementary  strand,  and 
therefore  must  be  removed.  This  is  accomplished  in  the  Breaker- Joyce  protocol  by 
denaturing  the  double-stranded  DNA  with  a  strong  base  solution  (NaOH),  and  then 
washing  the  non-biotinylated  complementary  strand  from  the  column.  Strong  base  can 
itself  cause  sequence  independent  cleavage  of  RNA,  and  is  known  to  increase  the  rate  of 
many  DNAzymes.  It  is  therefore  possible  that  many  DNAzymes,  in  particular  the  fastest 
DNAzymes,  are  lost  through  this  base-catalyzed  cleavage  of  RNA.  The  strong  base  may 
also  destabilize  the  binding  of  the  catalysts  via  biotin  to  the  streptavidin  column,  resulting 
in  further  loss  of  potential  catalysts. 

Lastly,  DNAzyme  folding  is  largely  ignored.  After  flushing  out  the  NaOH  with 
reaction  buffer  (minus  the  "required"  cofactor),  the  reaction  is  "started"  by  the  addition  of 
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the  cofactor.  Correct  folding  is  assumed  to  occur  without  the  required  cofactor,  and 
quickly  relative  to  catalysis. 

Each  of  these  problems  arises  because  the  Breaker- Joyce  protocol  has  the  2'- 
hydroxyl  group  of  the  RNA  molecule  to  be  free  during  the  set-up.  To  resolve  these 
problems,  research  in  this  disseration  develops  techniques  to  eliminate  loss  of  catalysts 
during  set  up  by  using  a  caged-ribose.  Caged-ribose  was  synthesized  with  the  2'-hydroxyl 
group  protected  via  an  ortho-nitrobenzyl  protecting  group.  This  protecting  group  can  then 
be  removed  by  UV  photolysis  (Figure  1-2). 
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Figure  1-2.  Exposing  the  caged-ribose  converts  2'-orthonitrobenzyl 
protecting  group  to  the  standard  hydroxyl  group,  thus  generating  a  standard 
RNA  nucleotide. 
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Our  eventual  goal  is  to  estimate  catalytic  distribution  in  a  random  library,  and  use 
this  estimate  to  determine  which  factors  enhance  the  catalytic  power  of  a  library.  To  make 
this  estimate  will  require  a  realistic  mathematical  simulation  of  the  selection  system 
which,  in  turn,  requires  detailed  knowledge  of  the  catalytic  process.  Towards  these  goals, 
research  in  this  dissertation  seeks  a  detailed  understanding  of  why  the  phosphodiesterase 
activity  isolated  using  the  Breaker- Joyce  procedure  routinely  reaches  a  low  plateau.  It  is 
also  crucial  to  determine  whether  the  reaction  behaves  as  a  first  order  chemical  process  (as 
predicted). 

Conditions  for  the  use  of  caged-ribose  have  been  optimized,  allowing  reaction 
during  the  set  up  to  be  prevented.  Preliminary  selections  with  the  caged-ribose  indicate 
that  the  laser  photolysis  of  the  caged-ribose  creates  a  slight  increase  in  background 
cleavage,  both  experimental  and  simulated  selections  indicate  that  this  leakage 
dramatically  increases  the  number  of  rounds  of  selection  required  to  isolate  active 
catalysts.  Although  this  limits  the  utility  of  caged-ribose  for  selections,  the  background 
does  not  significantly  alter  kinetic  analysis,  allowing  its  use  for  studying  the  pre-activation 
requirements  of  DNAzymes,  such  as  folding. 

To  aid  in  the  development  of  realistic  models,  we  undertook  a  detailed  study  of  an 
individual  DNAzyme,  614.  This  revealed  a  number  of  significant  findings.  It  is  widely 
assumed  that  including  a  cofactor  (such  as  Mg++  or  other  divalent  cations)  into  the 
selection  buffer  will  result  in  the  selection  for  cofactor-dependent  catalysts.  Indeed,  the 
Breaker- Joyce  protocol  presumes  that  potential  catalysts  are  completely  inactive  without 
the  added  cofactor.  We  have  determined  that,  although  614  was  selected  in  the  presence 
of  Mg++ ,  614  retains  complete  catalytic  activity  in  the  absence  of  Mg^ .  It  is  also  widely 
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assumed  that  the  identity  of  the  constant  regions  (arbitrarily  chosen  by  researchers) 
surrounding  the  randomized  region  is  largely  irrelevant  to  the  catalytic  activity  eventually 
selected,  and  it  is  instead  only  the  catalytic  potential  of  the  random  region  that  is  being 
tested.  The  DNAzyme  614  was  selected  with  2  nucleotide  changes  to  the  standard 
Breaker- Joyce  constant  region;  detailed  analysis  of  614  revealed  that  614  can  act  in  both  a 
uni-molecular  and  bi-molecular  fashion.  This  may  be  a  direct  consequence  of  the  two 
nucleotide  changes  from  the  Breaker- Joyce  motif  (although  this  is  not  known  for  certain 
since  few  researchers  have  directly  tested  whether  the  isolated  DNAzymes  are  in  fact 
acting  only  in  a  uni-molecular  fashion).  This  has  given  us  insight  into  how  to  design  and 
simulate  future  IVS. 

We  have  also  isolated  several  catalysts,  some  of  which  are  close  relatives  of  614, 
615  and  616  isolated  in  a  previous  selection,  which  each  display  catalytic  activity.  This  is 
significant  in  two  aspects,  first  in  that  the  catalytic  activity  varies  significantly  with  only  a 
minimal  sequence  changes.  This  suggests  that  the  landscape  in  the  near  vicinity  (one  or 
two  nucleotides)  of  a  particular  active  molecule  contains  many  catalysts,  presenting 
opportunities  for  continuous  exploration  of  the  functional  landscape.  Second,  the 
abundance  of  near-relatives  that  retain  activity  suggests  that  most  estimates  of  the  catalytic 
frequency  may  be  gross  underestimations,  as  near-relatives  may  appear  identical  under  the 
methods  (restriction  digest)  used  to  distinguish  them. 
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CHAPTER  2 
MATERIALS  AND  METHODS 


Preparation  of  Precursor  DNAymes  via  PCR 
DNAzymes  for  kinetics  were  prepared  by  PCR  amplification  of  the  template 
(either  synthesized  by  IDT,  or  from  a  clone)  using  one  of  various  primers  to  generate  the 
catalytic  strand  and  one  of  various  primers  to  generate  the  complementary  strand.  All 
templates  had  a  common  5'  and  3'  constant  region  to  which  the  complementary  and 
catalytic  strand  primers  can  bind.  Between  the  primer  binding  sites  is  a  40  nucleotide 
region  of  varying  sequence: 

"library"  template  (complementary  strand):  5 '  -gtgccaagcttaccgtcac  (n40)  - 

GAGATGTCGCCATCTCTTCC 

"614" template:  5' -ctgcagaattctaatacgactcactataggaagacatggcgactctc- 

ACATCATGCGAGCACACGCAATAGCCTGATAAGGTTGGTAGTGACGGTAAGCTTGGCAC 

Two  sequence  variants  of  the  catalytic  strand  primer  were  commonly  used,  but  both 
bound  to  the  constant  region  of  the  various  templates: 
"ang  +  ribose"  catalytic  strand  primer: 

5 ' -      CTGCAGAATTCTAATACGACTCACTATrAGGAAGACATGGCGAC-TCTC ) 

"BJ  +  ribose"  catalytic  strand  primer: 

5 ' -F-GGGACGAATTCTAATACGACTCACTATrAGGAAGAGATGGCGACATCTC ) 

The  differences  between  the  "ang"  and  "BJ"  primers  are  underlined;  "-"  represents  an 
alignment  gap;  "N"  is  equimolar  concentrations  of  each  dNTP,  "F"  is  a  5 'Fluorescein; 
"rA"  is  a  ribo-Adenosine.  Primer  variants  were  also  used  in  which  the  ribo-adenosine 
was  replaced  with  either  deoxy-adenosine  ("ang  -  ribose"  and  "BJ  -ribose")  or  caged- 


24 


25 

ribose  ("ang  +  cage"  and  "BJ  +  cage").     The  complementary  strand  primer  sequence 
was  always  the  same  (except  "complementary  +5TAIL  2C9*G"  in  which  the  5'  G  was 
changed  to  T),  but  varied  in  non-coding  modifications  appended  to  the  5 '-end: 
"complementary  no  5'-P"  primer: 

(5' -GTGCCAAGCTTACCGTCA) 

"complementary  +  5'-Phosphate"  primer : 

(5' -P-GTGCCAAGCTTACCGTCA) 

"complementary  +  5'tail#l"  primer: 

(5' -GGTGGGTGGG-C18-GTGCCAAGCTTACCGTCA) 

"complementary  +5TAIL  2C9"  primer: 

( 5 ' -AAAAAAAAAAAAAAAAAAAA-c9-c9-GTGCCAAGCTTACCGTCA) 

"complementary  +5TAIL  C18#2"  primer: 

( 5 '- AAAAAAAAAAAAAAAAAAAA- C 1 8 - GTGCCAAGCTTACCGTCA ) 

"complementary  +5TAIL  C18#3"  primer: 

(5'-  AAAAAAAAAAAAAAA-C18-GTGCCAAGCTTACCGTCA) 

"complementary  +5TAIL  2C9*G"  primer: 

(5' -AAAAAAAAAAAAAAAAAAAA-c9-c9-TTGCCAAGCTTACCGTCA) 

"complementary  +5TAIL  3C9"  primer: 

( 5 '- AAAAAAAAAAAAAAAAAAAA- c 9 - c 9 - c 9 -GTGCCAAGCTTACCGTCA) 

"complementary  +5TAIL  4x2'OME"  primer : 

( 5 ' -AAAAAAAAAAAAAAAAAAAA-mUmUmUmU-GTGCCAAGCTTACCGTCA) 


"CI 8"  is  an  18-atom  hexaethyleneglycol-based  spacer,  "C9"  is  a  9-atom 
triethyleneglycol-based  spacer,  "mU"  is  a  2'0-methyl-uridine,  "P"  is  phosphate.  Typical 
conditions  for  a  100  uL  PCR  contained  up  to  1  ng  template,  100  nM  catalytic  strand 
primer,  100  nM  complementary  strand  primer,  100  uM  dNTPS  (either  standard  A,  G,  C, 
and  T,  or  a  "nonstandard"  mixture  in  which  thymidine  is  replaced  by  5-(3-aminoallyl)- 
2'deoxyuridine,  henceforth  referred  to  as  "E-base"),  10  mM  KC1,  20  mM  Tris-HCl  (pH 
8.8),  10  mM  (NH4)2S04,  2  mM  MgS04,  0.1%  Triton  X-100,  3-4  units  polymerase  (Taq 
or  Vent  exo-),  and  10  uCuries  alpha-32P-CTP  (for  internally  labeled  samples).  PCR 
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amplification  consisted  of  3  minutes  at  96°C;  followed  by  20  cycles  of  45  seconds  at 
96°C,  45  seconds  at  50°C,  and  2  minutes  at  72°C. 

PCR  for  in  vitro  selections  (IVS)  varied  slightly:  400  nM  of  the  catalytic  strand 
primer  was  used  (to  compensate  for  hairpins  formed  by  the  "BJ  +cage"  primer),  up  to  40 
cycles  of  PCR  were  used  in  the  early  rounds  of  IVS,  Vent  exo-  was  used  (rather  than  Taq 
because  it  incorporates  5' -functional ized  nucleotides  such  as  E-base  with  greater 
efficiency),  and  primer  binding  was  done  at  57°C  rather  than  50°C  (again,  to  compensate 
for  hairpins  formed  by  BJ  primers). 

Point  mutations  introduced  within  in  the  catalytic  strand  primer  region  but  3 '  of 
the  ribose-adenosine,  were  incorporated  using  a  two  step  PCR.  In  the  first  step,  1 0  cycles 
of  PCR  were  conducted  in  which  the  catalytic  strand  primer  was  replaced  with  one 
containing  the  desired  point  mutation  and  a  5  nucleotide  truncation  from  the  5 'end.  The 
product  of  this  PCR  was  used  as  the  template  (diluted  1:10)  for  a  second  PCR  containing 
a  catalytic  strand  primer  truncated  two  nucleotides  3'  of  the  ribose-adenosine.  This  two- 
step  PCR  method  allowed  for  the  introduction  of  several  point  mutations  into  the 
catalytic  primer  region  without  necessitating  unique  DNA-RNA  chimeric  primers  for 
each  mutant. 

Preparation  of  Single-Stranded  DNAzymes 

Double  stranded  DNAzymes  generated  via  PCR  using  the  5'-phosphorylated  form 

of  the  complementary  strand  primer  ("complementary  +  5 '-phosphate)  were  converted  to 

single-stranded    DNA    by    digestion    of   the    complementary    strand    using    lambda 

exonuclease,  an  exonuclease  enzyme  specific  for  5'-phosphorylated  double  stranded 
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DNA.  Following  ethanol  precipitation  of  PCR  products,  DNA  pellets  were  resuspended 
in  25  uL  of  exonuclease  solution  (5  units  lambda  exonuclease  enzyme,  67  mM  glycine- 
KOH  (pH  9.4),  2.5  mM  MgCl2,  50  ug/mL  BSA)  per  100  uL  PCR.  Samples  were  mixed 
thoroughly  and  incubated  at  37°C  for  30  minutes.  Reactions  were  terminated  by  adding 
excess  formamide  stop  dye  (1  mg/mL  xylene  cyanol,  1  mg/mL  bromophenol  blue,  10 
mM  EDTA,  in  98%  formamide)  and  heating  at  80°C  for  10  minutes.  The  single-stranded 

products  were  loaded  on  an  8%  PAGE/urea  gel  and  full  length  ssDNA  products  excised 
using  a  sterile  razor  (thoroughly  washed  with  ethanol  between  each  sample).  Each  gel 
slice  was  crushed  and  eluted  in  350  uL  elution  buffer  (500  mM  NH4OAC,  0.1  mM 
EDTA,  0.1%  SDS)  overnight.  Gel  purified  samples  were  then  extracted  in 
phenol/chloroform/isoamyl  alcohol,  then  extracteded  choloroform/isoamyl  alcohol,  and 
precipitated  in  ammonium  acetate  and  ethanol. 

Asymmetric  PCR  was  also  used  to  generate  single  stranded  DNAzymes.  In  this 
procedure,  the  complementary  strand  primer  was  synthesized  with  a  10-20  nucleotide 
extension  (usually  poly-adenosine)  connected  to  the  5 '-end  of  the  primer  by  one  of 
several  linkers  (C9,  CI 8,  2'O-methyl-Uridine).  Vent  and  Taq  polymerase  cannot 
efficiently  read-through  these  linkers.  Therefore,  the  catalytic  strand  generated  by 
extension  of  the  catalytic  primer  terminates  at  the  linker,  generating  the  correct  size  for 
full-length  product.  Complementary  strand  molecules  generated  by  extension  of  a 
complementary  strand  primer  containing  the  "5'  tail"  are  longer  than  the  full-length 
product  by  the  length  of  the  tail,  and  are  easily  separated  by  gel  purification  on  an  8% 
PAGE/urea  gel.  Because  Vent  and  Taq  polymerase  occasionally  terminate  extension 
(about  50%  of  the  time)  when  encountering  a  caged-ribose  in  the  template,  PCR  using  a 
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caged-ribose  in  the  catalytic  primer  and  a  1 5  nucleotide  tail  on  the  complementary  primer 
generated  a  complementary  strand  that  was  either  121  nucleotides  (106  nucleotides  plus  a 
15  nucleotide  tail)  or  94  nucleotides  (106  nucleotides  plus  a  15  nucleotide  tail  minus  27- 
nucleotides  following  the  caged-ribose).  The  desired  catalytic  strand  was  easily 
identified  as  the  middle  band  and  cleanly  excised  on  an  8%  PAGE/urea  gel.  Gel  purified 
samples  were  crushed,  soaked  overnight  in  elution  buffer,  and  then  extracted  in 
phenol/chloroform/isoamyl  alcohol,  extracted  again  in  choloroform/isoamyl  alcohol,  and 
precipitated  in  ammonium  acetate  and  ethanol. 

5'-End  Labeling  of  DNA 
Single-stranded  DNA  (20  uM  )was  5 '-labeled  with  20  uCi  gamma-32P-ATP  using 
10  units  of  T4  polynucleotide  kinase,  70  mM  Tris-HCl  (pH  7.6),  10  mM  MgCl2,  5  mM 
dithiothreitol  in  a  final  volume  of  10  uL.  Reactions  were  incubated  for  30  minutes  at 
37°C  ,  followed  by  addition  of  equal  volume  TE,  and  finally  inactivated  by  incubating  for 
20  minutes  at  70°C.  End-labeled  DNA  was  separated  from  unincorporated  nucleotides  by 
spinning  through  a  G-25  column  at  600  g  for  3  minutes. 

DNAzyme  Kinetic  Assays 
Following  gel  purification  of  single-stranded  DNAzymes  and  substrates,  samples 
were  ethanol  precipitated  and  re-suspended  in  50  mM  HEPES  buffer.  Samples  were  then 
transferred  to  a  new  tube  and  the  concentration  estimated  by  measuring  specific 
radioactivity  following  a  measurement  of  by  Cherenkov  radiation.  Samples  were  then 
diluted  to  twice  the  desired  final  concentration  with  additional  50  mM  HEPES  buffer. 
For  trans  assays,  enzyme  and  substrates  were  typically  mixed  together  (unless  otherwise 
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noted),  and  diluted  to  twice  the  desired  final  concentration.  Samples  were  then  mixed 
with  equal  volume  2X  reaction  buffer  (typically  2M  NaCl,  2  mM  MgCl2,  50  mM  HEPES 
pH  7).  Reactions  were  then  heat  denatured  and  refolded  using  a  thermocycler  "slow 
cool"  protocol  (heating  to  96°C  for  3  minutes,  and  cooling  to  23°C  over  10  minutes).  For 
reactions  with  ribose-adenosine,  "time  zero"  was  considered  to  be  the  time  at  which  the 
sample  completed  the  "slow  cool."  For  reactions  with  a  caged-ribose,  the  reactions  were 
initiated  when  the  laser  de-protected  the  caged-ribose  (within  one  hour  of  the  completion 
of  the  "slow  cool").  Unless  otherwise  noted,  reactions  were  conducted  at  25°C 
(incubated  on  a  hot  block  set  to  25 °C  to  prevent  the  temperature  from  dropping  below 
25 °C).  Reactions  were  terminated  at  various  time  points  by  diluting  an  aliquot  of  the 
reaction  in  excess  formamide  stop  dye,  and  then  frozen  (-20°C).  Time  points  were  run  on 

an  8%  PAGE/urea  sequencing  gels  and  the  percent  cleaved  quantified  using  a  Bio-Rad 
phosphorimager. 

Cloning  and  Sequencing  DNAzvmes 
Prior  to  cloning,  single-stranded  DNAzymes  were  first  converted  to  double 
stranded  DNA  by  PCR  amplification,  usually  with  only  the  complementary  strand  primer 
("complementary  no  5 '-phosphate").  PCR  conditions  were:  96°C  for  3  minutes; 
followed  by  three  cycles  of  45  seconds  at  96°C,  four  minute  ramp  cool  to  50°C  plus  45 
seconds  at  50°C,  and  1  minute  at  72°C;  followed  by  7  minutes  at  72°C.  When  cloning 
cleaved  and  uncleaved  614,  the  desired  single-stranded  DNA  was  (usually)  first  isolated 
via  gel  purification.  PCR  amplifying  the  unpurified  614  reaction  mixture  containing 
both  cleaved  and  uncleaved  614  with  only  the  "complementary  no  5 '-phosphate"  primer 
yielded  only  uncleaved  molecule  clones;  cleaved  614  molecules  were  therefore  cloned  by 
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using  the  complementary  strand  primer  "complementary  no  5 '-phosphate"  with  the  5'- 
truncated  catalytic  primer  "3 '-cleaved  oligo."  Fresh  double  stranded  PCR  products  were 
cloned  using  the  TOPO  TA  Cloning  System  (Invitrogen)  and  plated  onto  Agarose  plates 
containing  ampicillin.  When  cloning  sequences  from  the  pool  of  survivors  from  the  "H" 
selection  condition,  transformed  cells  were  given  only  15  minutes  to  recover  in  antibiotic 
free  media  prior  to  plating  on  antibiotic  containing  plates.  This  prevents  recently 
transformed  clones  from  doubling  prior  to  plating  and  therefore  favors  sequencing  only 
unique  species  from  the  original  pool. 

DNA  was  prepared  from  individual  clones  using  a  modified  alkaline  lysis 
protocol.  Individual  clones  from  plates  were  used  to  inoculate  5  mL  of  TYGPN  media 
containing  freshly  added  ampicillin  and  grown  overnight  at  37°C  with  vigorous  shaking. 
Following  overnight  growth,  2  mL  of  cells  were  pelleted  by  centrifugation  for  1  minute  at 
16,000  g.  Growth  media  was  removed  and  the  cell  pellet  re-suspended  in  Solution  I  (300 
uL:   10  mM  Tris  pH  8.0,  1  mM  EDTA,  and  50  Ug/ml  Rnase  A).  Solution  II  (300  uL:   1 
mL  fresh  2  M  NaOH,  1  mL  10%  SDS,  8  mL  water)  was  then  added  and  the  samples  were 
then  inverted  gently  to  mix.  Solution  III  (300  uL:  5  M  potassium  acetate  pH  4.8)  was 
then  added,  followed  by  gentle  mixing.  The  sample  was  centrifuged  for  2  minutes  at 
16,000  g,  after  which  500  uL  of  chloroform  was  added  and  gently  mixed.  The  samples 
were  then  centrifuged  again  at  16,000  g  and  750  uL  of  the  aqueous  supernatant  was 
transferred  to  a  new  tube  containing  isopropanol  (750  uL).  This  sample  was  centrifuged 
for  6  minutes  at  16,000  g.  The  isopropanol  was  completely  removed,  and  the  remaining 
pellet  re-suspended  immediately  in  100  uL  water,  followed  by  the  addition  of  100  uL 
15%  PEG-8000.  The  re-suspended  pellet  was  stored  on  ice  for  30  minutes  (or  overnight), 
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after  which  the  sample  was  centrifuged  for  15  minutes  at  16,000  g.  The  supernatant  was 
removed  and  the  pellet  rinsed  with  70%  ethanol  (200  uL).  The  ethanol  was  removed  and 
the  pellet  was  dried  completely  under  vacuum.  The  resulting  DNA  pellet  was  re- 
suspended  in  20  uL  of  water.  This  protocol  routinely  yielded  100  ng/uL  of  DNA  of 
sufficient  purity  for  sequencing  reactions. 

Sequences  transformed  into  the  TOPO  TA  Cloning  Vector  were  sequenced 
using  3.2  pM  primer  "1224"  (5'-CGCCAGGGTTTTCCCAGTCACGAC)  with  300-500 
ng  plasmid  DNA  template,  2X  final  concentration  Big  Dye  reaction  buffer  and  2  uL  Big 
Dye  Terminator  Sequencing  mix  in  a  final  volume  of  10  uL.  Samples  were  overlayed 
with  mineral  oil  and  amplified  using  25  cycles  of  PCR  at  96°C  for  30  seconds,  50°C  for 
15  seconds,  and  60°C  for  4  minutes.  Unincorporated  nucleotides  were  removed  by 
spinning  9.5  uL  of  the  recovered  PCR  reaction  through  a  column  of  400  uL  G-25 
Sephadex  at  800  g  for  2  minutes.  The  recovered  sample  was  then  dried  to  completion,  re- 
suspended  in  17  uL  formamide  sequencing  buffer,  heated  to  95°C  for  2  minutes  and 
quickly  transferred  to  ice.  Sequencing  samples  were  analyzed  on  an  Applied  Biosystems 
Prism  310  Genetic  Analyzer.  Sequencing  results  were  confirmed  by  examining  the 
chromatograms  manually  using  the  "Sequencher"  software  package. 

In  Vitro  Selection 
DNAzyme  libraries  for  the  first  round  of  selection  were  prepared  by  a  single  cycle 
of  run-off  PCR  using  the  library  template  (5'-GTGCCAAGCTTACCGTCAC-N40- 
GAGATGTCGCCATCTCTTCC  (where  N  indicates  equal  molar  concentrations  of  A,  T, 
G,  and  C)  and  one  of  three  different  catalytic  strand  primers:  "ang  +  ribose,"  "BJ  + 
cage,"  or  "BJ  +  de-protected  cage"  (the  "BJ  +  de-protected  cage"  was  the  same  primer 
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as  "BJ  +  cage"  but  the  caged-ribose  had  been  de-protected  by  previous  exposure  to  300 
pulses  of  laser  (100  mJ/pulse)).  Each  primer  was  extended  using  either  the  standard 
dNTPs  (equimolar  T,  A,  G,  and  C)  or  with  a  non-standard,  5-position-functionalized 
nucleotide  triphosphate  (5-(3-aminoallyl)-2'-deoxyuridine  5 '-triphosphate,  referred  to  as 
E-base)  substituted  for  T-triphosphate  (equimolar  E,  A,  G,  C).  This  yielded  six  different 
IVS  starting  libraries, 


libary  #1 
libary  #2 
libary  #3 
libary  #4 
libary  #5 
libary  #6 


"BJ  +  cage"  primer,  standard  dNTPS; 

"ang  +  ribose"  primer,  standard  dNTPS; 

"BJ  +  cage"  primer,  AGC+E; 

"ang  +  ribose"  primer,  AGC+E; 

"BJ  +  de-protected  cage"  primer,  standard  dNTPS; 

"BJ  +  de-protected  cage"  primer,  AGC+E) 


Run-off  reactions  volumes  ranged  from  three  to  20  milliliters,  each  containing  1  ng/uL 
library  template,  100  nM  catalytic  strand  primer,  100  uM  dNTPs,  10  mM  KC1,  20  mM 
Tris-HCl  (pH  8.8),  10  mM  (NH4)2S04,  2  mM  Mg2S04,  0.1%  Triton  X-100,  20  units/mL 
Vent  exo-,  and  4.3  uCuries/mL  alpha-32P-CTP.  One  milliliter  of  each  sample  mixture 
(minus  polymerase)  was  aliquoted  into  1.5  mL  eppendorf  tubes,  heated  at  96°C  for  8 
minutes,  and  then  slowly  cooled  to  55°C  over  a  period  of  30  minutes.    Polymerase  was 

then  added,  and  the  samples  were  transferred  to  72°C  for  15  minutes.  The  samples  were 
then  ethanol  precipitated  with  ammonium  acetate  and  with  glycogen  as  a  carrier  by 
storing  overnight  at  -80°C  and  then  centrifuging  for  40  minutes  in  Corex  glass  tubes  at 

10,000  rpms  in  a  Beckman  centrifuge  at  4°C.  The  ethanol  was  removed  and  the  pellet 
was  re-suspended  in  200  uL  water.  The  samples  were  ethanol  precipitated  a  second  time 
in  with  ammonium  acetate  by  centrifuging  at  16,000  g  in  a  desktop  centrifuge  for  20 
minutes  at  4°C.  After  removing  the  ethanol,  the  pellet  was  resuspended  in  80  uL 
formamide  stop  dye  and  warmed  at  37°C  for  30  minutes  to  completely  dissolve  the  pellet. 
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The  samples  were  then  run  on  an  8%  PAGE-urea  gel,  and  the  full-length  products 
excised  using  a  sterile  razor  (rinsed  thoroughly  with  ethanol  between  samples).  Each  gel 
slice  was  crushed  and  eluted  in  350  uL  elution  buffer  (500  mM  NH4OAC,  0.1  mM 
EDTA,  0.1%  SDS)  overnight.  Samples  were  extracted  in  phenol/chloroform/isoamyl 
alcohol  (25:24:1),  then  extracted  in  chloroform: isoamyl  alcohol  (24:1),  and  precipitated 
in  ethanol  with  ammonium  acetate.  Pellets  were  resuspended  in  50  mM  HEPES  (pH  7) 
and  transferred  to  a  new  tube,  at  which  point  the  final  concentration  was  estimated  using 
specific  radioactivity.  All  samples  were  brought  up  to  327  nM,  except  library  #1  ("BJ  + 
cage"  primer  extended  using  standard  dNTPS  was  brought  up  to  3270  nM).  An  aliquot 
(30  uL)  of  libary  #1  was  set  aside  for  the  IVS  condition  "E"  (containing  ten  times  the 
starting  amount  of  library),  and  the  remaining  library  #1  was  diluted  to  327  nM.  An 
aliquot  (30  uL)  of  each  sample  was  then  mixed  with  equal  volume  of  the  appropriate  2X 
reaction  buffer  as  summarized  in  Table  2- 1 . 

The  samples  were  quickly  heated  to  96°C  for  3  minutes,  and  cooled  to  23°C  over 
10  minutes.  For  samples  without  a  caged-ribose  (G,  H,  N  and  O),  the  completion  of  the 
slow  cool  to  23  °C  was  considered  the  initiation  of  reaction.  All  samples  were  then 
exposed  to  150  pulses  of  laser  (90  mJ/pulse).  The  time  of  laser  exposure  was  considered 
the  time  of  reaction  initiation  for  all  samples  containing  a  caged-ribose.  Samples  were 
allowed  to  react  at  25°C  for  various  times  (20  to  720  minutes,  see  Table  2-1),  and  then 
stopped  by  adding  1.25  volume  of  formamide  stop  dye.  The  samples  were  then  run  on  an 
8%  PAGE/urea  gel  next  to  a  "cleaved  library"  marker.  The  area  corresponding  to  the 
size  of  the  cleaved  library  for  each  sample  was  excised,  crushed,  and  eluted  overnight 
(referred  to  as  "survivors  of  round  1"). 
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Table  2-1.  Selection  Conditions  for  Each  of  the  Fifteen  Parallel  IVS  Experiments. 


Final  Buffer: 

50  mM  HEPES 

Selection 

pH7;  IMNaCI; 

Time 

Sample 

Bases | 

AGC+) 

primer 

1  mM  of  metal  = 

(minutes) 

A 

T-base 

BJ  +  cage 

-0- 

120 

B 

T-base 
T-base 

BJ  +  cage 
BJ  +  cage 

Zn^ 

120 

C 

Mg++ 

20 

D 

T-base 

BJ  +  cage 
BJ  +  cage 
BJ  +  cage 

Mg^ 
Mg++ 
Mg^ 

120 
120 
720 

E 

T-base  - 

-  lOx  [library] 

F 

T-base 

G 

T-base 

BJ  +  de-protected  cage 

Mg" 

120 

H 

T-base 

ang  +  ribose 

Mg++ 

120 
120 

I 

E-base 

BJ  +  cage 

-0- 

J 

E-base 
E-base 

BJ  +  cage 
BJ  +  cage 

Zn++ 

120 

K 

Mg++ 

20 

L 

E-base 

BJ  +  cage 

Mg^ 

120 

M 

E-base 

BJ  +  cage 

Mg^ 

720 

N 

E-base 

BJ  +  de-protected  cage 

Mg^ 

120 

O 

E-base 

ang  +  ribose 

Mg^ 

120 

Following  elution,  samples  were  extracted  in  phenol/chloroform/isoamyl  alcohol, 
then  extracted  in  chloroform/isoamyl  alcohol,  and  mixed  with  ammonium  acetate  and 
ethanol.  Half  of  each  sample  was  stored  in  ethanol,  and  the  remaining  half  was 
precipitated  and  resuspended  in  50  uL  water.  For  subsequent  rounds  of  selection,  thirty 
microliters  of  the  survivors  of  the  previous  round  (in  water)  were  used  in  a  100  uL  PCR 
containing  400  nM  catalytic  strand  primer  (either  "ang  +ribose,"  "BJ  +  cage,"  or  "BJ  + 
de-protected  cage"),  100  nM  complementary  strand  primer  ("complementary  +  5'- 
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phosphate"),  100  uM  dNTPS  (either  AGCT  or  AGCE),  10  mM  KC1, 20  mM  Tris-HCl 
(pH  8.8),  10  mM  (NH4)2S04,  2  mM  MgS04,  0.1%  Triton  X-100,  and  3  units  Vent  exo- 
polymerase,  and  7.5  uCuries  alpha-32P-CTP.  PCR  amplification  consisted  of  3  minutes 
at  96°C,  followed  by  various  numbers  of  cycles  of  45  seconds  at  96°C,  45  seconds  at 
57°C,  and  2  minutes  at  72°C  (a  2  minute  extension  was  determined  to  be  sufficiently  long 
so  as  to  yield  approximately  equal  incorporation  efficiency  for  both  standard  and  non- 
standard nucleotides). 

Following  PCR,  samples  were  extracted  in  phenol/chloroform/isoamyl  alcohol, 
then  extracted  chloroform/isoamyl  alcohol,  and  precipitated  with  ammonium  acetate  and 
ethanol.  Samples  were  re-suspended  in  exonuc lease/buffer  solution  (25  uL  final  volume: 
5  units  lambda  exonuclease  enzyme,  67  mM  glycine-KOH  (pH  9.4),  2.5  mM  MgCl2,  50 
ug/mL  BSA)  and  incubated  at  37°C  for  30  minutes.     Reactions  were  terminated  by 
adding  60  uL  formamide  stop  dye  and  heating  at  80°C  for  10  minutes.  The  single- 
stranded  products  were  loaded  on  an  8%  PAGE/urea  gel  and  full  length  ssDNA  products 
excised.  Gel  purified  samples  were  then  extracted  in  phenol/chloroform/isoamyl  alcohol, 
then  extracted  chloroform/isoamyl  alcohol,  and  precipitated  in  ammonium  acetate  and 
ethanol.  Samples  were  re-suspended  in  50  mM  HEPES  buffer  (pH  7)  to  a  concentration 
of  25  -  30  nM.  Each  sample  (25  uL)  was  then  mixed  with  equal  volume  of  2X  reaction 
buffer  (see  Table  1  for  buffer  conditions  for  each  sample),  heated  and  slowly  cooled, 
exposed  to  laser  (150  pulses,  80-100  mJ/pulse),  and  stopped  with  formamide  stop  dye 
following  various  incubation  times  (see  Table  1  for  selection  time  used  for  each  sample). 
This  procedure  was  repeated  for  each  round  of  selection. 


CHAPTER  3 
RESULTS  OF  614  ANALYSIS 

Research  Objectives 

Understanding  why  added  functionality  does  not  appear  to  significantly  improve 
the  catalytic  power  of  nucleic  acid  enzymes  under  in  vitro  selection  (IVS)  schemes 
requires  knowledge  of  how  chemical  functionality  alters  the  distribution  of  catalytic 
potential  within  sequence  space.  This  in  turn  requires  that  an  experimental  model  system 
be  comprehensively  understood  so  that  it  can  be  modeled  mathematically  with  sufficient 
accuracy  to  support  quantitative  analysis.  Indeed,  this  depth  of  knowledge  is  important  to 
understanding  how  molecular  evolution  in  general  affects  phenotypic  evolution  in  an 
evolutionary  landscape  over  sequence  space  even  up  to  organismal  or  ecosystem  levels. 
The  success  of  previous  IVS  has  been  only  modest  (described  in  the  introduction),  and 
apparently  insufficient  to  explain  the  origins  of  life;  this  modest  success  of  previous  IVS 
may,  in  part,  be  attributable  to  several  details  of  the  IVS  experimental  system.  One 
strong  possibility  is  that  the  fastest  catalysts  are  lost  in  the  experimental  work-up. 

Our  aim  is  two  fold:  (a)  Examine  the  experimental  systems  in  sufficient  rigor  so 
as  to  allow  mathematical  modeling,  and  (b)  improve  the  selection  procedure  so  as  to 
minimize  the  potential  loss  of  catalysts  and  allow  quantitative  analysis.  To  develop  a 
system  that  supports  detailed  mathematical  modeling  of  DNAzyme  selections,  we  first 
examine  in  detail  a  model  DNAzyme  known  as  614  (isolated  from  a  previous  IVS).  To 
minimize  potential  loss  of  fast  catalysts,  we  explore  the  use  liquid  phase,  gel 
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electrophoresis-based  selection,  and  the  use  of  an  ortho-nitrobenzyl-protected  ribose 
("caged-ribose")  which  can  be  quickly  converted  to  the  unprotected  ribose  by  laser.  An 
IVS  is  performed  using  these  new  procedures.  Preliminary  modeling  of  the  experimental 
system  is  done  to  determine  how  various  parameters  of  the  experimental  system  have  an 
impact  on  the  mathematical  model,  and  how  the  experimental  system  can  be  fine-tuned  to 
improve  modeling  accuracy. 

Developing  Methods  for  Preparing  Single-stranded  DNAzymes  by 
Exonuclease  Degradation  of  5'-Phosphorylated  Complementary  Strand 

Column  purification  of  single-stranded  DNAzymes  present  numerous  problems, 
as  outlined  in  the  introduction.  The  ability  to  enzymatically  remove  the  complementary 
strand  was  tested  using  lambda  exonuclease.  Lambda  exonuclease  is  a  processive  5'  to 
3'  DNA  exonuclease  specific  for  5'-phosphorylated  DNA.  Double  stranded  614 
DNAzyme  was  generated  with  PCR  using  a  5'-phosphorylated  complementary  strand 
primer  ("complimentary  +  5'-P")  and  a  non-phosphorylated  catalytic  strand  primer 
(either  "ang  +  ribose"  or  "ang  +  cage").  Samples  were  internally  labeled  with  alpha-  P- 
CTP  to  allow  visualization  of  both  catalytic  and  complementary  strands.  Each  of  the  two 
samples  was  then  treated  with  or  without  exonuclease  and/or  strong  base  (which  cleaves 
unprotected  ribose)  and  run  on  a  denaturing  PAGE/urea  gel.  Figure  3-1  shows  the  results 
of  this  treatment  (all  figures  at  are  at  the  end  of  the  chapter). 

Both  untreated  ribose-  and  caged-614  double  stranded  PCR  products  show  two 
bands,  the  upper  corresponding  to  the  full  size  product,  and  the  lower  band  corresponding 
roughly  to  the  size  of  the  cleaved  product  (lane  1  and  5).  The  amount  of  the  lower  band 
is  significantly  greater  in  the  PCR  product  generated  using  the  caged-ribose  primer 
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(compare  lane  1  and  5).  This  suggests  that  the  lower  band  in  lane  5,  although  the  size  of 
the  catalytic  strand  cleaved  at  the  ribose,  is  not  actually  cleaved  product  from  the 
catalytic  strand.  This  band  is  actually  the  result  of  polymerase  termination  when  the 
complementary  strand  extension  reaches  the  caged-ribose  in  the  template.  This  is 
confirmed  by  the  observation  that  this  band  is  diminished  by  exonuclease  treatment 
specific  for  the  complementary  strand  (lane  7). 

When  the  ribose-614  double  stranded  PCR  product  is  treated  with  strong  base, 
there  is  a  significant  increase  in  the  lower  band,  corresponding  to  conversion  of  the  full- 
length  catalytic  strand  to  cleaved  product.  Comparing  lane  1  with  lane  2  shows  that 
approximately  half  of  the  ribose-614  double  stranded  PCR  product  is  converted  to 
cleaved  product  following  treatment  with  strong  base  (corresponding  to  the  cleavage  of 
the  catalytic  strand),  while  approximately  half  remains  full-length  (corresponding  to  the 
complementary  strand).  The  catalytic  strand,  approximately  half  of  originally  double 
stranded  ribose-614  PCR  product,  remains  full-length  following  exonuclease  treatment 
(lane  3),  although  there  is  no  increase  in  the  cleaved  product  (the  complementary  strand 
is  degraded  to  single  nucleotides).  The  full-length  band  remaining  following  treatment  of 
ribose-614  with  exonuclease  disappears  when  also  treated  with  strong  base  (lane  4).  This 
full-length  band  remains  for  caged-614  treated  with  exonuclease  and  strong  base  (lane  8), 
demonstrating  that  conversion  of  the  upper  band  to  cleavage  product  seen  in  lane  4 
corresponds  to  the  catalytic  strand  (there  is  slight  conversion  of  full-length  caged-614  to 
cleaved  product  under  strong  base,  possibly  the  result  of  partial  de-protection  under  room 
light).    The  absence  of  any  full-length  product  in  lane  4  demonstrates  that  nearly  the 
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entire  catalytic  strand  has  been  converted  to  the  cleaved  product  by  the  strong  base  and 
nearly  the  entire  complementary  strand  has  been  degraded  by  exonuclease. 

Asymmetric  PCR  with  "Tails."  Followed  by  Gel  Purification 

A  non-enzymatic  method  was  also  pursued  for  isolation  of  single-stranded 
DNAzymes.  In  this  procedure,  termed  "asymmetric  PCR,"  the  complementary  strand 
primer  was  appended  with  a  "tail"  consisting  of  10-20  nucleotides  connected  to  the  5' 
end  of  the  primer  via  a  linker.  Several  linkers  were  tested  for  there  ability  to  terminate 
polymerase  extension  when  encountered  on  the  template  complementary  strand.  If 
polymerase  extension  of  the  catalytic  strand  in  fact  terminates  when  it  encounters  a  linker 
on  the  complementary  strand,  all  catalytic  strands  would  be  the  normal  full-length. 
However,  all  molecules  generated  by  extension  of  the  complementary  strand  primer 
would  be  increased  in  effective  size  by  the  length  of  the  tail  appended  to  the  primer. 

Figure  3-2  shows  the  product  generated  by  PCR  amplification  of  a  106-nt 
fragment  using  5'-32P-labeled  primers  and  Taq  polymerase.  PCR  amplifications  were 
done  in  pairs:  odd-numbered  lanes  show  the  product  generated  using  the  5'-   P-labeled 
catalytic  strand  primer  ("ang  +  ribose")  with  unlabeled  complementary  strand  primer 
(designated  in  the  subsequent  lane),  while  even-numbered  lanes  show  the  product 
generated  with  unlabeled  catalytic  primer  and  5'-32P-labeled  complementary  strand 
primer.  All  products  generated  from  complementary  primers  with  tails  are  increased  in 
size  by  the  length  of  the  tail,  while  products  generated  from  the  catalytic  strand  primer 
are  of  normal  size  (106  nucleotides)  for  each  complementary  strand  primer  except 
"C18#l"  (lane  1  and  2).  This  demonstrates  that  Taq  polymerase  does  in  fact  terminate 
effectively  when  it  encounters  a  linker  in  the  template  for  all  linkers  tested  except 
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"C18#l"  (lane  1).  It  is  believed  that  the  complementary  primer  containing  the  "C18#l" 
linker  did  not  cause  termination  of  the  catalytic  strand  extension  due  to  incorrect 
synthesis  of  the  complementary  strand  primer  (the  same  linker  was  used  in  C18#2,  and 
this  successfully  terminated  extension). 

The  size  differential  between  catalytic  and  complementary  strands  generated  by 
asymmetric  PCR  was  then  used  to  isolate  the  two  strands  on  an  8%  PAGE/urea  gel. 
Figure  3-3  shows  the  result  of  PCR  in  which  both  strands  are  labeled  using  alpha-32P- 
CTP,  demonstrating  that  all  products  are  easily  resolved.  Lane  3  shows  the  full-length 
catalytic  strand  purified  using  exonuclease.  Lane  4  shows  two  bands  generated  via 
asymmetric  PCR,  the  106-nucleotide  band  corresponding  to  the  catalytic  strand  and  the 
121-nucleotide  band  corresponding  to  the  complementary  strand  plus  a  15-nucleotide  tail. 
Treatment  of  the  sample  in  lane  4  with  strong  base  converts  the  majority  of  the  106- 
nucleotide  band  to  the  79-nucleotide  cleavage  product.  A  small  amount  of  the  121- 
nucleotide  band  is  converted  to  the  94-nucleotide  band  by  strong  base 
treatment,  indicating  Taq  polymerase  can  occasionally  read  through  the  linker  of  the 
complementary  strand,  thus  generating  a  catalytic  strand  that  is  increased  in  size  by  the 
length  of  the  tail. 

As  noted  in  the  previous  section,  the  presence  of  the  caged-ribose  in  the  catalytic 
strand  occasionally  causes  strand  termination,  generating  a  complementary  strand  27- 
nucleotides  shorter  than  full-length  (the  caged-ribose  is  at  position  27-  in  the  catalytic 
strand).  Therefore,  asymmetric  PCR  using  a  complementary  strand  primer  with  a  15- 
nucleotide  tail  generates  three  bands,  a  106-nt  band  corresponding  to  the  normal  sized 
catalytic  strand,  a  121  -nt  band  corresponding  to  the  full-length  complementary  strand 
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plus  tail,  and  a  94-nt  band  resulting  from  termination  of  the  complementary  strand  plus 
tail  at  the  caged  ribose. 

DNAzyme  614  with  uncaged-  and  caged-ribose 
Achieving  maximal  deprotection  of  caged  ribose 

A  caged-ribose  was  synthesized  and  incorporated  into  the  catalytic  strand  primer, 
known  as  "ang  +  cage"  (an  identical  primer  was  synthesized  with  an  unprotected  ribose 
known  as  "ang  +  ribose".  Using  this  caged-ribose,  we  are  able  to  prevent  cleavage  at  the 
ribose  during  the  work  up  phase.  This  was  demonstrated  by  incubating  the  caged-ribose 
primer,  "ang  +  cage,"  in  0.5M  NaOH  at  80°C  for  one  hour,  resulting  in  only  6-10% 
conversion  to  the  cleaved  form  (via  base  hydrolysis);  the  same  conditions  for  the 
unprotected-ribose  primer,  "ang  +  ribose,"  resulted  in  94%  (90-97%)  cleavage.  The 
observation  that  about  6%  of  the  unprotected-ribose  primer  does  not  convert  to  the 
cleaved  form  under  base  hydrolysis  conditions  suggest  that  as  much  as  6%  of  the  "ang  + 
ribose"  primer  may  be  missing  the  ribose-adenosine  (due  to  imperfections  in  the  DNA 
synthesis  and  purification). 

To  test  the  ability  to  de-protect  the  caged-ribose,  "ang  +  cage"  was  exposed  to 
increasing  number  of  pulses  of  excimer  laser  (frequency  =  308  nM,  energy  =  30-50 
mJ/pulse)  to  induce  conversion  from  the  protected  to  unprotected  ribose.  The  efficiency 
of  conversion  to  unprotected  ribose  was  assessed  by  incubating  under  base  hydrolysis 
conditions  (Figure  3-4).  The  amount  of  caged-primer  cleaved  by  strong  base  increased 
with  increasing  laser,  reaching  a  plateau  of  80%  cleaved  with  100  pulses  of  laser.  Added 
laser  beyond  100  pulses  did  not  further  increase  the  amount  of  cleaved  caged-614. 
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The  amount  of  cleaved  "ang  +  cage"  (via  base  hydrolysis)  at  maximum  de-protection 
(100  pulses  of  laser)  was  consistently  about  10-15%  lower  than  the  cleavage  obtained  for 
"ang  +  ribose."  This  10-15%  reduction  in  cleavage  for  "ang  +  cage"  must  be  the  result  of 
either  two  factors:  incomplete  de-protection  or  missing  the  caged-ribose  adenosine. 
Although  cleavage  of  caged-614  plateaus  with  100  pulse  of  laser,  showing  no  further 
cleavage  even  with  1000  pulses  of  laser,  it  is  still  possible  that  some  caged-ribose 
molecules  have  not  been  hit  by  a  laser  photon. 

The  other  possible  explanation  for  reduced  cleavage  of  de-protected  caged-614  is 
the  absence  of  the  caged-ribose  adenosine,  again  as  a  result  of  imperfect  primer  synthesis 
and  purification.  As  much  as  6%  of  the  "ang  +  ribose"  primer  is  believed  to  be  missing 
the  ribose-adenosine  (above),  but  the  caged-adenosine  phosphoramidite  may  be 
incorporated  into  the  primer  more  poorly  than  the  ribose-adenosine  phosphoramidite, 
resulting  in  as  much  as  20%  of  the  "ang  +  ribose"  primer  missing  the  caged-adenosine. 
This  issue  is  revisited  in  a  following  section  "Cloning  and  Sequencing  614." 

Kinetic  profile  of  614  with  uncaged-  and  caged-ribose 

The  "ang  +  ribose"  and  "ang  +  cage"  primers  were  incorporated  into  614  via 
PCR.  The  catalytic  strand  of  the  PCR  products,  called  "ribose-614"  or  "caged-614," 
were  purified  first  by  exonuclease  degradation  of  the  complementary  strand  (which  was 
5'-phosphorylated),  followed  by  gel  purification  (8%  PAGE/urea).  The  ability  of 
"ribose-614"  and  caged-614  DNAzymes  to  self-cleave  in  reaction  buffer  (1  M  NaCl,  1 
mM  MgCl2,  50  mM  HEPES  pH  7)  was  tested.  Purified  caged-614  was  re-suspended  in 
reaction  buffer,  heated  to  97°C  for  3  minutes  and  cooled  to  23  °C  over  10  minutes 
(together  this  is  referred  to  as  slow  cool),  and  incubated  at  room  temperature  for 
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approximately  one  hour.  Cleavage  was  then  initiated  by  exposing  the  sample  to  varying 
pulses  of  laser  (0-1000  pulses).  Cleavage  of  "ribose-614"  was  initiated  by  resuspending 
in  reaction  buffer  and  slow  cooled.  Figure  3-5  shows  similar  progress  curves  for  both 
"ribose-614"  and  caged-614  cleavage.  Both  samples  show  identical  initial  cleavage 
kinetics,  with  cleavage  plateaus  near  50%.  The  caged-614  plateau  is  regularly  about  10- 
15%  lower  than  "ribose-614."  Several  scenarios  can  explain  the  lower  cleavage  plateau 
seen  with  caged-614:  (a)  insufficient  deprotection  of  caged-ribose,  (b)  the  614  sequence 
is  missing  the  ribose-adenosine  at  position  27  (discussed  above,  and  below  "Cloning  and 
Sequencing  614"),  or  (c)  laser  induced  damage  to  the  caged-614  DNA  sequence.  The 
reasons  for  both  caged-614  and  ribose-614  reaching  cleavage  plateau  far  below 
completion  is  explored  below  ("Understanding  Reasons  for  Incomplete  Cleavage  of 
DNAzyme  614). 

To  confirm  that  the  lower  cleavage  plateau  of  caged-614  was  not  due  to 
insufficient  laser,  kinetic  experiments  were  conducted  with  additional  pulses  of  laser.  In 
one  experiment,  caged-614  cleavage  was  initiated  with  150  pulses  of  laser  and  allowed  to 
react  for  73  hours,  approaching  cleavage  plateau.  The  sample  was  divided  into  two 
aliquots,  and  one  aliquot  of  the  sample  was  exposed  to  an  additional  150  pulses  of  laser. 
The  added  laser  caused  a  slight  increase  in  the  amount  cleaved,  but  this  increase  was 
immediate,  and  not  increasing  with  time,  suggesting  against  the  possibility  of  additional 
de-protection  of  caged-ribose. 

Another  experiment  was  also  performed  in  which  caged-614  cleavage  was 
initiated  with  varying  amounts  of  laser,  from  0  to  1000  pulses.  The  cleavage  plateau 
increased  with  increasing  amounts  of  laser  until  150  pulses  of  laser,  after  which 
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increasing  amounts  of  laser  did  not  cause  an  increase  in  cleavage  plateau  (Figure  3-6a 
and  7).  When  the  percent  of  caged-614  product  was  normalized  to  account  for  the 
incomplete  de-protection  at  low  levels  of  laser  (the  fraction  cleaved  at  each  time  point 
was  divided  by  the  fraction  deprotected  as  determined  by  base  hydrolysis),  the  kinetic 
profiles  for  caged-614  with  varying  amounts  of  laser  overlap  (except,  of  course,  zero 
laser,  which  is  not  expected  to  cleave  at  all).  This  shows  that  they  are  all  following  the 
same  kinetics  parameters,  with  the  plateau  varying  with  the  amount  of  laser  (with 
maximum  achieved  at  150  pulses  of  laser). 

The  caged-614  exposed  to  increasing  laser  was  also  incubated  under  base 
hydrolysis  conditions.  The  amount  of  caged-614  cleaved  by  strong  base  increased  with 
increasing  laser,  reaching  a  cleavage  plateau  with  150  pulses  of  laser.  Added  laser 
beyond  150  pulses  did  not  further  increase  the  amount  of  base-cleaved  caged-614  (Figure 
3-8).  This  directly  parallels  the  results  of  caged  primer,  "ang  +  cage,"  plus  laser  cleaved 
by  strong  base  condition,  indicating  that  maximum  de -protection  has  be  obtained. 

Laser  Does  Not  Damage  Ribose-614 

It  is  possible  that  caged-614  shows  a  lower  cleavage  plateau  than  ribose-614 
because  the  laser  (UV  frequency)  used  to  initiate  the  de-protection  of  the  caged-ribose 
causes  damage  to  the  DNAzyme.  Non-specific  breaking  of  the  phosphodiester  backbone 
is  expected  to  create  a  smear  running  from  1  to  106  nucleotides.  Although  slight 
smearing  was  noticed  between  the  cleaved  and  full  length  product,  there  was  no  smearing 
below  the  full  length  product  (79  nucleotides).  Furthermore,  this  smearing  was  not 
present  when  ribose-614  was  exposed  to  identical  laser  treatment. 
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The  laser  can  potentially  cause  DNA  damage  which  would  inactive  a  DNAzyme 
without  breaking  the  phosphodiester  backbone  (such  as  forming  thymine-thymine 
dimers).  We  were  unable  to  find  evidence  for  thymine-thymine  dimer  formation  when 
ribose-614  was  exposed  to  laser  (even  though  ribose-614  does  contain  two  successive 
thymines  at  positions  82  and  83,  and  100  and  101).  If  the  laser  is  causing  non-specific 
DNA  damage  resulting  in  a  reduced  cleavage  plateau,  this  should  be  evident  even  without 
the  caged-ribose.  To  test  this,  ribose-614  was  exposed  to  increasing  amounts  of  laser, 
from  0  to  300  pulses  (100  mJ/pulse).  No  change  was  noticed  in  cleavage  plateau  or 
initial  rates,  suggesting  laser  does  not  cause  damage  to  the  DNA  (Figure  3-9). 

DNAzyme  614  is  NaCl-dependent,  and  MgCb-independent 

DNAzyme  614  was  selected  using  the  standard  Breaker- Joyce  IVS  protocol  and 
using  a  reaction  buffer  of  1  M  NaCl,  1  mM  MgCl2,  and  50  mM  HEPES  pH  7.  This 
protocol  is  expected  to  create  ds-cleaving  enzymes  that  utilize  the  magnesium  as  a 
cofactor,  while  the  sodium  neutralizes  the  negative  charge  of  the  phosphodiester 
backbone  and  thus  stabilizes  folding.  The  Breaker- Joyce  IVS  protocol  assumes  that  the 
DNAzymes  generated  may  fold,  but  will  not  cleave  without  the  added  magnesium 
cofactor,  and  further,  that  the  rate  of  cleavage  will  increase  with  increasing  amounts  of 
magnesium  cofactor.  Figure  3-10  shows  that  there  is  little  difference  in  the  kinetics  of 
ribose-614  if  is  incubated  with  either  1  mM  or  3  mM  MgC^.  Experiments  in  which  the 
MgCh  was  completely  removed  and  3  mM  EDTA  was  added  also  showed  no  change  in 
cleavage  rates.  However,  removing  the  NaCl  from  the  reaction  buffer  completely 
prevents  catalysis  (Figure  3-11).  DNAzyme  614  clearly  is  not  magnesium-dependent,  as 
expected  under  the  Breaker- Joyce  experimental  set-up. 
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Supplementing  the  standard  reaction  buffer  with  2  M  ammonium  acetate  increases 
the  rate  of  cleavage.  Replacing  the  1  M  NaCl  of  the  reaction  buffer  with  1M  ammonium 
acetate  resulted  in  cleavage,  although  at  a  greatly  reduced  rate. 

Understanding  Reasons  for  Incomplete  Cleavage  of  DNAzyme  614 
Interpreting  and  modeling  the  results  of  an  IVS  requires  a  comprehensive 
understanding  of  the  mechanics  of  the  model  system.  The  previous  section  quantified  the 
experimental  parameters  of  using  a  caged-ribose  that  is  de-protected  by  laser.  Analysis 
of  caged  and  ribose-614  both  demonstrated  a  cleavage  plateau  far  below  "true 
completion"  (defined  as  cleavage  caused  by  strong  base,  which  is  -94%  for  "ang  + 
ribose"  and  -80%  for  "ang  +cage").  The  explanation  for  the  cleavage  plateau  must  be 
common  for  both  caged  and  ribose-614,  and  quite  likely  any  other  DNAzyme  generated 
via  this  IVS  procedure.  Indeed,  low  cleavage  plateau  is  an  issue  with  many  ribozymes 
and  DNAzymes,  but  the  specific  causes  for  this  reduced  plateau  is  often  ignored, 
presumably  because  the  researchers  either  do  not  realize,  or  do  not  believe  this 
phenomena  is  significant.  We  now  seek  to  explain  the  failure  for  the  reaction  to  reach 
"true  completion,"  knowing  that  such  information  is  crucial  to  both  redesigning  the  IVS 
protocol  and  creating  a  reasonable  model  of  the  IVS  experiment. 

Incomplete  De-protection  of  Caged-Ribose 

We  have  already  demonstrated  that  we  have  achieved  maximum  de-protection  of 
the  caged-ribose  with  100  pulses  of  laser  (meaning  additional  laser  does  not  result  in 
more  de-protection).  But  is  incomplete  de-protection,  even  at  maximum  de-protection,  a 
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plausible  explanation  for  the  failure  to  reach  true  completion?  This  possibility  is  ruled 
out  on  the  basis  that  ribose-614  also  reaches  a  plateau  that  is  far  below  true  completion. 
Although  the  maximal  cleavage  for  caged-614  is  consistently  10-15%  lower  than  that  of 
ribose-614  (both  when  self-cleaved  and  when  cleaved  by  base  hydrolysis),  both  reach 
cleavage  plateaus  that  are  as  much  as  35%  below  the  amount  of  cleavage  caused  by 
strong  base.  The  explanation  for  the  reduced  cleavage  plateau  must  be  common  for  both 
caged  and  ribose-614,  and  quite  likely  for  any  other  DNAzyme  generated  via  this  IVS 
procedure. 

Incomplete  Removal  of  the  Complementary  Strand 

Two  methods  were  used  for  removing  the  complementary  strand  generated  during 
the  PCR  synthesis  of  our  DNAzymes,  exonuclease  degradation  and  asymmetric  PCR. 
Asymmetric  PCR  has  nearly  no  potential  for  complementary  strand  impurity  since  the 
complementary  strand  is  15  nucleotides  longer  than  the  catalytic  strand,  and  thus  easily 
separated  from  the  catalytic  strand  during  gel  purification.  Experiments  in  which 
"ribose-614"  was  prepared  using  asymmetric  PCR  followed  by  gel-purification,  showed 
cleavage  plateaus  similar  to  ribose-614  generated  with  exonuclease,  suggesting  the 
cleavage  plateau  cannot  be  attributed  to  inhibition  by  contamination  of  the 
complementary  strand. 

When  "ribose-614"  generated  via  the  exonuclease  method  was  treated  with  strong 
base,  similar  amounts  of  conversion  to  the  cleaved  form  were  observed  as  seen  for  the 
ribose  primer  "ang  +  ribose"  (85-95%  for  "ribose-614"  vs.  90-97%  for  "ang  +  ribose). 
Since  there  is  no  possibility  of  a  contaminating  complementary  strand  in  the  "ang  + 
ribose"  primer,  the  -5%  difference  between  base  cleavage  of  the  "ang  +  ribose"  primer 
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and  the  full  length  DNAzyme  "ribose-614"  is  the  maximum  amount  attributable  to 
incomplete  degradation  of  the  complementary  strand  by  exonuclease. 

Exonuclease  efficiency  was  also  tested  by  5'-phosphorylating  a  single  primer  with 
gamma-32P-ATP.  This  radio-labeled  primer  was  then  used  to  synthesize  full  length 
"ribose-614."  The  double  stranded  product  was  then  divided  into  two  aliquots,  and  one 
was  subjected  to  standard  exonuclease  treatment.  Equal  amounts  of  each  sample  were 
then  electrophoresed  on  a  PAGE-urea  gel,  and  the  amount  of  full-length  product 
assessed.  Exonuclease  treated  samples  consistently  showed  less  than  10%  of  the  original 
material  remained  full  length. 

Experiments  in  which  small  amounts  of  the  614  complement  (10%  of  the  amount 
ribose-614)  were  added  to  ribose-614  had  no  impact  on  the  cleavage  plateau  (Figure 
3-12A  and  B).  When  cleavage  experiments  were  performed  with  equal  amounts  of 
ribose-614  and  its  complement,  however,  the  presence  of  the  complementary  strand 
dramatically  reduced  the  cleavage  plateau  to  nearly  30%  (Figure  3-12  C  and  D).  This 
demonstrates  the  importance  of  near-complete  removal  of  the  complementary  strand  to 
accurately  examine  DNAzyme  kinetics.  We  believe  both  asymmetric  PCR  and 
exonuclease  degradation  are  superior  methods  to  column  purification  under  strong  base 
for  removing  the  complementary  strand. 

Approach  to  Chemical  Equilibrium 

The  failure  of  a  substrate  to  completely  transform  itself  to  product  may  be  the 
result  of  an  approach  to  chemical  equilibrium.  In  this  case,  the  reverse  reaction  occurs 
fast  enough  to  convert  a  significant  fraction  of  the  product  back  into  substrate.  We  tested 
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to  see  if  this  was  the  case  in  our  cleavage  reaction  by  several  methods.  First,  cleavage  a 
200  nM  solution  of  caged-614  was  initiated  by  300  pulses  of  laser  and  allowed  to  react 
for  144  hours.  At  this  time,  near  cleavage  plateau,  the  reaction  was  split  into  three 
aliquots.  One  aliquot  was  unaltered.  The  second  aliquot  was  diluted  twenty-five  fold  in 
reaction  buffer.  Unlabeled  d-614  was  added  to  the  third  aliquot  (to  a  final  concentration 
of  200  nM).  D-614  is  the  same  sequence  as  ribose-614,  but  with  a  deoxy-adenosine  used 
in  the  place  of  the  ribose  adenosine.  In  a  later  section  we  show  that  d-614  acts  as  an 
enzyme  to  cleave  r-614  in  trans.  If  the  cleavage  plateau  is  a  result  of  equilibrium 
between  the  forward  reaction  and  reverse  reaction,  the  addition  of  additional  enzyme  (d- 
614)  should  shift  the  equilibrium  in  favor  the  formation  of  more  product  over  time. 
Similarly,  diluting  the  reaction  reduces  the  likelihood  that  an  enzyme  can  find  two 
substrates  to  ligate  together,  thus  shifting  the  equilibrium  in  favor  of  cleavage  (the 
cleavage  reaction  can  occur  in  cis,  and  thus  is  not  as  strongly  affected  by  dilution  as 
ligation).  Figure  3-13  shows  that  neither  treatment  significantly  alters  the  cleavage 
plateau,  arguing  against  equilibrium  between  the  forward  and  reverse  reaction. 

In  a  second  experiment,  the  79-nucleotide  cleavage  product  was  isolated  via  gel 
purification.  This  cleavage  product  was  radio-labeled,  and  following  purification,  was 
incubated  (in  excess)  with  full-length  ribose-614  (not  radio-labeled).  Although 
unlabelled,  previous  studies  have  shown  that  the  full-length  614  DNAzyme  begins  to 
cleave  itself,  generating  both  the  79  and  27-nucleotide  products.  If  the  reverse  reaction 
(ligation)  occurs,  the  excess  of  exogenous,  radiolabeled  79-nucleotide  fragment  ensures 
that  there  is  a  greater  chance  for  the  radio-labeled  79-nucleotide  fragment  to  become 
ligated  to  the  27-nucleotide  fragment,  thus  causing  an  increase  in  size  of  the  radio-labeled 
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79  nucleotide  fragment  over  time.  The  reaction  was  monitored  for  400  hours  for  a  shift 
in  size  of  the  79-nucleotide  product  to  full-length  (106  nucleotide),  but  no  such 
conversion  was  observed.  This  argues  against  the  possibility  that  a  significant  fraction  of 
614  cleavage  products  are  ligated  together. 

Are  the  27-  and  79-nucleotide  Fragments  Acting  as  Catalysts  or  Inhibitors? 

Two  products  are  created  by  the  cleavage  of  614  at  the  ribo-adenosine,  a  27-  and  a 
79-nucleotide  fragment.  The  standard  Breaker  and  Joyce  IVS  protocol  was  "designed"  to 
select  for  molecules  that  are  capable  of  self-cleavage  (in  cis),  and  cis  cleavage  is  assumed 
to  predominate.  However,  without  making  assumptions  about  the  structure  and 
mechanism  of  the  614  DNAzyme,  we  cannot  rule  out  the  possibility  that  either  of  the 
cleavage  products  continue  to  act  as  a  ribonuclease  DNAzyme,  or  as  a  ribonuclease 
DNAzyme  inhibitor.  We  therefore  set  out  to  test  this  possibility  using  our  model 
DNAzyme,  614. 

A  caged-614  cleavage  assay  was  set  up  as  normal  and,  following  extended 
incubation  in  reaction  buffer,  the  27-  and  79-nucleotide  products  were  gel  purified. 
These  products  were  then  added  in  trans  to  a  new  sample  of  full-length  caged-614,  as 
well  as  a  caged-library  sample  (identical  to  caged-614,  but  containing  a  randomized  40- 
nucleotide  region  between  the  catalytic  and  complementary  primers).  Following 
initiation  with  150  pulses  of  laser  and  incubation  in  reaction  buffer,  cleavage  of  the  new 
caged-614  sample  occurred  without  change,  even  in  the  presence  of  equal  concentration 
of  the  27-  and  79-nucleotide  fragments  (Figure  3-14).  This  suggests  against  the  idea  that 
the  27-  and  a  79-nucleotide  fragments  significantly  affect  614  cleavage  by  acting  as 
inhibitors  or  catalysts,  at  least  when  present  in  equal  concentration  (which  is 
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approximately  the  highest  concentration  they  reach  under  normal  614  cleavage  conditions 
in  which  approximately  half  of  614  converts  into  cleavage  products). 

The  caged-library  alone  did  not  show  detectable  catalysis  under  standard  reaction 
conditions;  this  follows  our  expectation  that  an  unselected,  random  library  will  have  very 
few  active  catalysts.  Addition  of  the  27-nucleotide  product  alone  did  not  increase  caged- 
library  cleavage,  while  addition  of  the  79-nucleotide  product  alone  did  slightly  increase 
cleavage  of  the  library.  Cleavage  levels  were  extremely  low,  however:  while  614  had 
reached  a  cleavage  plateau  of  65%  in  about  50  hours,  the  library  cleaved  by  the  79- 
nucleotide  fragment  had  reached  only  15%  cleavage  in  300  hours.  The  addition  of  the 
27-nucleotide  fragment  together  with  the  79-nucleotide  fragment  reduced  cleavage  of  the 
library. 

This  shows  that  the  79  nucleotide  fragment  can  act  as  a  DNAzyme  to  cleave  a 
random  library  substrate  in  trans,  but  the  rate  of  cleavage  is  orders  of  magnitude  slower 
that  the  full-length  614.  The  27-nucleotide  fragment  reduces  this  already  low  activity  of 
the  79-nucleotide  fragment,  suggesting  it  acts  to  inhibit  cleavage  by  competing  with  the 
substrate  (ribose-library)  for  binding  to  the  79-nucleotide  fragment.  These  results  are 
considered  again  in  a  later  section  when  we  explore  potential  structures  of  614. 

Improperly  Folded  DNAzyme  614 

Having  shown  that  the  cleavage  products  do  not  significantly  alter  the  cleavage 
plateau,  we  then  set  out  to  determine  whether  the  plateau  was  due  to  the  mis-folding  of  a 
fraction  our  DNAzyme  into  an  inactive  conformation.  While  most  IVS  protocols  largely 
ignore  the  folding  issue  altogether  (assuming  folding  happens  quickly  and  efficiently,  in 
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respect  to  cleavage,  by  re-suspending  the  DNAzyme  in  reaction  buffer  without 
cofactors),  we  included  a  re-folding  step  that  is  easily  reproducible.  Following  re- 
suspending  the  DNAzyme  in  reaction  buffer,  samples  were  heated  to  97°C  for  three 
minutes  to  achieve  complete  denaturation.  The  samples  were  then  cooled  to  23 °C  over 
10  minutes  using  a  thermocycler.  Although  very  consistent,  even  this  folding  procedure 
cannot  guarantee  that  all  DNAzymes  find  the  same  "active"  conformation.  Indeed, 
structure  predictions  of  614  show  several  potential  structures,  perhaps  only  one  of  which 
is  active.  Folding  into  inactive  structures  can  potentially  contribute  to  the  low  cleavage 
plateau. 

When  614  cleavage  assays  are  stopped  by  freezing  to  -80°C  and  run  on  a  non- 
denaturing  PAGE  gel,  two  bands  are  present  at  initial  time  points.  The  first  (highest) 
band  represents  -15%  of  the  sample,  and  remains  unchanged  throughout  the  duration  of 
the  kinetic  assay.  The  second  (middle)  band  is  initially  85%  of  the  sample,  but  with  time, 
is  converted  to  a  third  (lowest)  band  (at  200  hours,  the  first  band  remains  15%  of  the  total 
sample,  while  the  second  band  is  reduced  to  -10%,  and  the  third  band  is  -75%).  This 
result  was  observed  when  r-614  was  purified  via  asymmetric  PCR  or  by  exonuclease 
degradation,  suggesting  that  the  unchanged  upper  band  likely  represents  an  alternative 
conformation  of  the  catalytic  strand  rather  than  contaminating  complementary  strand. 

We  set  out  to  test  this  possibility  with  two  methods.  First,  ribose-614  was 
incubated  under  standard  reaction  conditions  for  140  hours,  allowing  the  reaction  to 
approach  the  cleavage  plateau.  An  aliquot  of  this  sample  was  removed  and  denatured  by 
adding  2  times  the  volume  loading  dye  (95%  formamide,  EDTA,  xylene  cyanol, 
bromophenol  blue)  and  heating  at  90°C  for  2  minutes.  The  sample  was  then 
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electrophoresed  on  a  denaturing  PAGE/urea  gel,  and  the  uncleaved  ribose-614  was 
excised  and  purified.  This  previously  uncleaved,  gel-purified  ribose-614  was  ethanol 
precipitated,  resuspended  in  reaction  buffer,  and  folded  using  the  slow  cool  protocol. 

Cleavage  progress  following  this  gel-purification  was  then  compared  to  an 
untreated  sample.  Twenty-five  percent  of  the  gel-purified,  ribose-614  sample  was 
cleaved  in  the  300  hours  following  gel-purification  (Figure  3-15).  The  additional 
cleavage  following  the  gel-purification  suggests  that  some  of  the  uncleaved  sample  is 
locked  in  an  inactive  structure.  It  is  notable,  however,  that  the  gel-purified  sample 
reached  a  cleavage  plateau  of  approximately  25%,  far  lower  than  the  cleavage  plateau  of 
the  original  sample  (70%). 

While  part  of  this  incomplete  cleavage  may  again  be  attributed  to  mis-folded 
molecules,  if  mis-folding  were  the  only  cause  of  incomplete  cleavage,  a  plateau  of 
approximately  70%  would  be  expected  for  the  gel-purified  sample.  The  fact  that  the  gel- 
purified  sample  reaches  a  cleavage  plateau  far  below  70%  indicates  that,  while  mis- 
folding  may  account  for  as  much  as  25%  of  the  uncleaved  fraction,  other  reasons  must 
account  for  the  majority  of  the  uncleaved  fraction.  Gel-purification  allows  mis-folded 
molecules  the  opportunity  to  refold  and  cleave,  but  it  results  in  enrichment  for  molecules 
that  are  unable  to  cleave  for  reasons  other  than  incorrect  folding,  such  as  missing  the 
ribose-adenosine,  or  other  mutations. 

One  can  argue  that  gel-purification  increases  the  cleavage  of  ribose-614  by 
removing  some  inhibitory  factor  rather  than  allowing  for  proper  refolding.  Although  the 
experiments  described  in  previous  sections  demonstrated  no  significant  inhibitory  effect 
of  the  cleavage  products  on  614  cleavage,  we  tested  the  "mis-folding"  possibility  without 
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removing  any  potential  inhibitors.  We  did  this  by  again  allowing  614  to  reach  cleavage 
plateau,  and  then  denaturing  an  aliquot  of  the  sample  by  heating  it  to  97°C  for  3  minutes 
and  slowly  cooling  to  23°C  over  10  minutes.  Reheating  and  slowly  cooling  allows  any 
molecule  "stuck"  in  an  inactive  conformation  a  second  chance  to  adopt  an  active 
conformation.  As  in  the  gel-purification  experiments,  denaturing  and  refolding  the 
sample  via  the  slow  cool  procedure  increases  the  amount  of  material  cleaved  (Figure 
3-16).  The  increase  in  cleavage  is  significant  (usually  about  10%),  but  does  not  bring 
cleavage  up  to  94%  (the  amount  cleaved  by  strong  base  and  therefore  considered 
"completion")  and  therefore  cannot  account  entirely  for  the  uncleaved  614  at  plateau. 

Cloning  and  Sequencing  Cleaved  and  Uncleaved  614  Near  Cleavage  Plateau 

The  ability  of  strong  base  to  cleave  the  ribose-primer,  "ang  +  ribose,"  to  only  90- 
97%  suggested  that  about  6%  of  the  primer,  and  therefore  any  full  length  DNAzyme 
made  from  that  primer,  may  be  missing  the  ribose-adenosine.  For  the  caged-primer,  "ang 
+  cage,"  base  hydrolysis  following  laser  deprotection  resulted  in  only  80%  cleavage, 
suggesting  that  the  problem  of  missing  ribose-adenosine  may  be  even  more  severe  for  the 
caged-primer  (this  could  also  be  due  to  incomplete  deprotection,  despite  excess  laser). 
Mutations  may  also  occur  throughout  the  DNAzyme,  either  as  a  result  of  mutations 
introduced  during  the  synthesis  of  the  primers,  or  mutations  between  the  primers  as  a 
result  of  polymerase  error.  Together,  these  mutations  may  reduce  or  eliminate  the 
catalytic  power  of  a  fraction  of  the  DNAzyme  pool.  This  possibility  was  examined  by 
cloning  and  sequencing  both  cleaved  and  uncleaved  614  near  the  cleavage  plateau 
(Figure  3-17).  Of  the  28  sequenced  clones  of  614  that  were  cleaved  at  the  time  of  gel- 
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purification,  only  three  (1 1%)  were  found  to  contain  a  mutation  (all  three  were  in  the  N40 
region  between  the  primers). 

Of  the  65  clones  of  614  that  were  un-cleaved,  44  had  at  least  one  mutation  (68%). 
Twelve  clones  (18%  of  those  sequenced)  had  a  mutation  in  the  N40  region;  thirty-one 
clones  (46%)  had  a  mutation  in  the  "ang"  primer;  three  clones  (5%)  had  a  mutation  in  the 
complementary  primer.  Of  the  uncleaved-614  clones  sequenced,  nine  were  from  a  614 
DNAzyme  that  originally  contained  a  caged-ribose  (cleavage  initiated  by  laser).  Of  these 
nine  clones,  five  (56%)  were  shown  to  be  missing  the  ribose-adenosine,  as  opposed  to 
seven  of  56  clones  (13%)  isolated  from  uncleaved  ribose-614.  This  confirms  our  original 
suspicion  that  a  higher  proportion  of  the  "ang  +  cage"  primer  is  missing  the  ribose- 
adenosine  as  compared  to  the  "ang  +  ribose"  primer,  and  explains  both  the  observation 
that  about  15%  less  of  the  "ang  +  cage"  primer  with  maximum  de-protection  is  cleaved 
by  strong  base  than  the  "ang  +  ribose"  primer,  and  the  observation  that  caged-614  reaches 
a  cleavage  plateau  about  10%  lower  than  ribose-614. 

A  caveat  must  be  added,  however,  in  that  it  has  been  observed  that  the  polymerase 
tends  to  terminate  primer  elongation  about  50%  of  the  time  when  it  encounters  the  caged- 
ribose-adenosine  on  the  template  strand.  Given  this,  it  is  possible  that  the  polymerase 
skips  over  the  caged-ribose-adenosine  in  the  fraction  that  does  not  get  terminated  at  the 
caged-ribose,  and  thus  the  full-length  double  stranded  DNA  that  is  required  for  cloning  is 
missing  the  adenosine  as  a  result  of  the  polymerase  skipping  the  caged-ribose.  For  this  to 
occur,  the  caged-ribose  must  not  have  been  fully  de-protected  by  the  initial  laser 
treatment  (polymerase  does  not  terminate  appreciably  at  an  uncaged-ribose).  Together, 
this  confirms  that  the  lower  cleavage  seen  for  caged-614,  as  compared  to  ribose-614,  is  a 
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result  of  only  two  possibilities,  incomplete  de -protection  or  a  higher  fraction  of  the 
primer  missing  the  ribose-adenosine. 

Several  of  the  614  clones  with  mutations  were  tested  to  see  if  their  mutations  in 
fact  reduced  the  rate  of  cleavage.  Figures  3-18  and  19  show  the  cleavage  profiles  of 
several  mutants  and  demonstrate  that  nearly  all  of  the  mutations  cause  greatly  reduced 
cleavage  rates.  Together  with  the  observations  above,  we  have  shown  that  approximately 
6%  of  the  total  ribose-614  sample  does  not  cleave  because  it  is  missing  the  ribose- 
adenosine  (20%  for  the  caged-614),  approximately  10-15%  of  the  total  sample  may  be 
mis-folded  and  only  slowly  converts  into  the  active  conformation,  and  68%  of  the 
uncleaved  fraction  may  not  cleave  due  to  mutations. 

DNAzyme  614  Cleaves  in  cis  and  trans 
As  is  the  case  with  all  enzymes,  a  DNAzyme  reaction  can  be  thought  of  as 
occurring  in  several  steps:  binding  the  substrate,  the  chemical  step  (where  covalent  bonds 
are  made  or  broken),  and  release  of  the  substrate.  The  IVS  protocol  was  designed  so  that 
the  DNAzyme  contains  the  substrate  (ribose-adenosine)  linked  to  the  enzyme  (the 
random  N40  region),  and  engineered  such  that  the  DNAzyme  would  fold  back  onto  itself 
via  two  clamps  of  base  pairing,  one  on  each  side  of  the  ribose-adenosine,  thus  bringing 
the  "enzymatic  region"  in  proximity  to  the  "substrate."  Because  of  the  design,  the 
DNAzyme  is  expected  to  fold  more  rapidly  with  itself  than  bind  other  molecules  at 
typical  concentrations,  and  therefore  the  reaction  is  assumed  be  unimolecular.  However, 
because  all  molecules  in  a  library  share  the  same  5'-  and  3 '-primers,  over  60%  of  the 
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sequence  is  identical  in  all  molecules  (in  particular,  the  regions  surrounding  the  ribose- 
adenosine). 

This  opens  the  possibility  that  the  "enzymatic  region"  may  bind  to  the  "substrate 
region"  of  another  molecule  and  either  cleave  in  trans  or  inhibit  cleavage.  In  the  standard 
Breaker- Joyce  protocol  (which  was  used  to  generate  614),  binding  potential  DNAzymes 
to  a  column  is  believed  to  prevent  molecules  from  binding  to  and  cleaving  other 
molecules.  Amplification  and  reselection  of  the  survivors  is  also  believed  to 
preferentially  enrich  self-cleaving  DNAzymes.  It  is  possible,  however,  that  an 
"enzymatic"  DNAzyme  can  bind  to  a  "substrate"  molecule,  and  together  they  bind  to  the 
column  via  the  biotin  of  one  of  the  two  molecules.  This  conjoined  DNAzyme/substrate 
complex  survives  washing,  but  upon  cleaving  of  the  substrate,  either  both  enzyme  and 
substrate  are  released  from  the  column  (if  the  conjoined  molecule  was  attached  to  the 
column  via  the  biotin  of  the  substrate),  or  only  the  "substrate"  is  washed  off  (if  the 
conjoined  molecule  was  attached  to  the  column  via  the  biotin  of  the  enzyme).  This  may 
result  in  the  inadvertent  selection  for  DNAzymes  that  can  cleave  in  trans,  as  well  as 
molecules  that  are  good  substrates  for  /ra«s-cleavage. 

The  "ang"  primer  used  to  generate  DNAzyme  614  (as  well  as  kinetic  analysis)  is 
nearly  identical  to  the  Breaker- Joyce  primer,  but  with  two  single-nucleotide  changes  that 
reduce  the  strength  of  the  base-pairing  clamps  designed  to  encourage  cis  folding.  These 
changes  further  open  up  the  possibility  that  cleavage  can  occur  in  trans.  We  are  unaware 
of  any  published  results  that  explicitly  consider  the  impact  of  binding  clamps  on  the 
outcome  of  selections,  but  based  on  the  assumption  that  the  protocol  selects  for  cis- 
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cleaving  activity  of  the  random  region,  most  research  apparently  assume  the  impact  of 
these  engineered  sights  is  negligible. 

However,  some  published  accounts  have  reported  that  the  rate  of  some 
DNAzymes  appears  to  decrease  when  they  are  generated  with  additional  cycles  of  PCR 
(5  vs.  20  cycles  -  too  few  cycles  to  expect  a  significant  change  in  the  amount  of  mutation 
for  a  40-nucleotide  product)  (Geyer,  1997:  Breaker,  1995).  No  explanation  is  offered  for 
this  result.  Neither  papers  cited  do  not  report  the  concentration  of  DNAzyme  used  in 
their  kinetic  analysis,  presumably  because  they  never  considered  the  possibility  of  trans- 
cleavage  (and  it  is  therefore  likely  that  the  DNAzyme  concentration  was  not  controlled  in 
their  analysis).  If  the  concentration  is  not  directly  controlled,  additional  cycles  of  PCR 
will  obviously  result  in  additional  DNAzyme;  it  is  plausible  that  the  cleavage  rates 
changed  with  additional  cycles  of  PCR  because  the  DNAzyme  can  act  in  trans  and 
therefore  the  rate  is  concentration  dependent.  This  next  section  explicitly  explores  this 
possibility. 

Cleavage  Rate  of  Ribose-614  Varies  with  614  Concentration 

If  a  DNAzyme  exclusively  self-cleaves,  it  generates  product  exclusively  by  a  first 
order  process,  and  the  apparent  first  order  rate  constant  should  be  independent  of 
[DNAzyme].  A  plot  of  apparent  kobs  versus  [DNAzyme]  should  be  a  horizontal  line.  If 
product  were  generated  exclusively  by  a  second  order  process,  then  the  apparent  first 
order  rate  constant  would  be  dependent  upon  [DNAzyme].  A  plot  of  apparent  kobs  versus 
[DNAzyme]  should  be  a  line  that  includes  the  (0,0)  origin,  with  a  slope  equal  to  the 
second  order  rate  constant.  If  product  were  generated  exclusively  by  a  second  order 
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process  with  saturation  (implying  that,  for  the  DNAzyme  614,  one  molecule  of  614  must 
bind  to  another  molecule  of  614  for  a  reaction  to  occur,  but  that  the  614-614  complex 
could  disassociate  before  a  reaction  occurred,  a  condition  well  known  in  Michaelis- 
Menten  kinetics),  then  the  apparent  first  order  rate  constant  would  be  dependent  upon 
[614]  initially,  and  approach  an  asymptote  at  high  [614].  Here,  a  plot  of  apparent  kobs 
versus  [614]  should  be  a  line  that  includes  the  (0,0)  origin,  sloping  up  and  ultimately 
leveling  to  a  plateau. 

The  concentration  dependence  of  the  rate  of  cleavage  of  ribose-614  was  tested  by 
examining  the  progress  curve  and  the  initial  rates  for  ribose-614  cleavage  at  various 
different  dilutions  (430  nM  to  0.3  nM  ribose-614).  Results  from  this  experiment  (Figure 
3-20,  21,  and  22)  show  a  reality  where  several  rate  processes  are  operating  at  the  same 
time. 

Figure  3-21  shows  that  the  initial  rate  of  ribose-614  cleavage  decreases  with 
decreasing  concentration  (a  logarithmic  abscissa  is  used  to  capture  the  full  range  of  the 
experimental  concentrations).  The  decrease  in  the  initial  rate  of  cleavage  with  decreasing 
concentration  reaches  a  lower  limit  below  which  further  dilution  causes  very  little  further 
reduction  in  the  initial  rate.  This  implies  that  the  rate  does  have  some  concentration 
dependence  and  implies  that  substrate  is  being  consumed  by  two  paths:  a  first  order  rate 
process  and,  at  least  partly,  by  a  process  that  is  higher  than  first  order.  In  the  left  portion 
of  Figure  3-21  (low  [614]),  the  first  order  process  dominates  the  overall  rate.  In  the  right 
portion  of  Figure  3-21  (high  [614]),  second  order  processes  dominate  the  overall  rate. 
This  experiment  was  repeated  several  times  over  with  similar  results. 
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In  Figure  3-22A  (the  same  data  as  Figure  3-21,  but  with  a  linear  abscissa),  the 
intercept  at  [614]  =  0  is  not  at  the  origin.  This  requires  that  at  least  some  of  the  rate  arise 
via  a  unimolecular  process.  Figure  3-22B  shows  a  best-fit  line  extrapolated  to  zero 
(infinite  dilution)  for  614  concentrations  below  3.5  nM  (predominately  unimolecular) 
yields  an  intercept  corresponding  to  the  unimolecular  rate  constant  0.0059  hr"  (half-life 
is  1 15  hours).  The  slope  of  this  line  is  8.9xl05  M'hr"1,  which  for  a  second  order  process 
when  [S]«Km,  equals  the  second  order  rate  constant  (kcat/Km). 

A  plot  of  the  natural  log  of  uncleaved  ribose-614  (substrate)  versus  time  should  be 
linear  for  a  first  order  process;  a  linear  relationship  fit  the  data  well  for  low  ribose-614, 
but  not  for  higher  concentrations  (Figure  3-23).  The  negative  of  the  slope  of  the  best-fit 
line  for  0.3  nM  ribose-614  is  0.0042x1 0*3  hr"',  which  is  in  close  agreement  with  the  rate 
estimated  by  extrapolation  of  the  initial  rates  of  ribose-614  to  infinite  dilution  (Figure  3- 
22A  and  B). 

Further,  it  appears  from  Figure  3 -22 A  that  a  plateau  has  not  been  reached.  It  may 
be  that  the  [614]  is  simply  not  high  enough  in  this  experiment.  However,  this  plot  does 
not  rule  out  the  possibility  that  a  purely  bimolecular  process  (not  involving  saturation)  is 
contributing  to  the  overall  rate  of  the  reaction.  Although  this  rate  is  low  by  normal 
ribozyme  standards,  and  even  the  standard  of  some  DNAzyme  phosphodiesterases 
isolated  via  IVS,  this  unimolecular  rate  is  comparable  to  the  magnesium-independent 
catalyst  isolated  using  IVS  by  Geyer  (0.012  hour"1  for  "G3  Family"  DNAzyme  before 
optimization). 

We  then  tested  to  see  if  the  progress  curve  for  each  concentration  of  ribose-614  fit 
best  to  one  exponential  (in  which  both  the  rate  and  plateau  can  vary)  or  the  sum  of  two 
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exponential  equations  (in  which  both  the  rate  and  plateau  can  vary  for  each  equation).  At 
low  ribose-614  concentrations  (3.5  nM  and  below),  the  progress  curves  for  cleavage  fit 
well  to  a  single  exponential  equation,  whereas  at  higher  concentrations  the  progress 
curves  were  fit  better  by  the  sum  of  two  exponential  equations.  A  single  exponential 
curve  fit  to  progress  curve  for  0.3  nM  ribose-614  cleavage  estimated  a  rate  0.01 1x10" 
hr"1  (with  a  predicted  plateau  of  61%).  This  agrees  with  the  other  estimations  of  the 
unimolecular  rate  made  above  (which,  comparatively,  are  underestimations  since  they 
have  not  corrected  for  a  plateau  below  100%). 

This  suggests  that  at  low  concentrations  (3.5  nM  and  below),  the  reaction  behaves 
in  unimolecular  fashion  while  at  higher  concentrations  the  reaction  is  not  well 
characterized  by  a  single  unimolecular  rate  law.  Furthermore,  we  can  conclude  that,  at 
low  concentrations,  where  unimolecular  processes  dominate,  a  second  conformer  of  en- 
folded 614  with  lower  reactivity  is  not  present  at  significant  levels.  In  an  earlier  section 
("Improperly  Folded  DNAzyme  614"),  we  already  considered  variants  of  614  that  were 
inactive  (mutants,  for  example). 

This  is  important  because  if  these  DNAzymes  in  general  have  a  single  active 
conformation,  and  react  following  a  single  exponential  first  order  rate  law,  then  the 
distribution  of  catalytic  power  within  a  population  of  related  molecules  can  be  modeled 
(at  least  as  an  approximation)  by  a  transform  that  fits  a  spectrum  of  first  order 
exponential  processes  to  a  progress  curve. 

The  "non-unimolecular"  component  of  ribose-614  cleavage  is  not  purely  bi- 
molecular  as  tripling  the  concentration  of  ribose-614  does  not  increase  the  initial  rate 
nine-fold  as  would  be  expected  (but  in  fact  it  increases  less  than  two-fold).  Furthermore, 


61 


62 

if  the  reaction  where  simply  the  sum  of  a  unimolecular  and  bimolecular  reaction,  the 
progress  curve  should  fit  to  the  sum  of  two  power  functions  (Y  =  AX  +  BX  ,  where  X  is 
the  concentration  of  614,  Y  is  the  initial  rate  (kobs),  A  is  the  first  order  rate  constant  and  B 
is  the  second  order  rate  constant).  This  model,  however,  was  unable  to  provide  a 
reasonable  fit  for  the  experimental  results,  suggesting  that  ribose-614  cleavage  is  not 
simply  sum  of  a  unimolecular  and  a  bimolecular  reaction. 

Deoxvribose-614  Cleaves  Various  Ribose-Substrates 

The  observation  that  ribose-614  can  cleave  in  a  non-uni-molecular  fashion 
suggests  trans  cleavage.  This  was  tested  directly  by  incubating  614  with  various 
substrates,  each  containing  the  5'  catalytic  motif  and  ribose-adenosine.  Initially,  100  nM 
ribose-614  was  incubated  with  100  nM  of  5'-32P-labelled  "ang  +  ribose"  primer.  Figure 
3-24  shows  that  both  ribose-614  and  "ang  +  ribose"  are  cleaved,  clearly  establishing  trans 
cleavage  ability  of  614.  The  cleavage  of  ribose-614  in  the  presence  of  "ang  +  ribose" 
primer  was  lower  than  ribose-614  incubated  alone,  suggesting  that  the  "ang  +  ribose" 
primer  competes  with  ribose-614  for  cleavage  by  ribose-614  (Figure  3-25). 

We  then  tested  whether  a  614  DNAzyme  synthesized  with  a  deoxy-adenosine 
instead  of  a  ribose-adenosine  (referred  to  as  "d-614),  and  therefore  unable  to  cleave  itself, 
could  cleave  various  substrates  in  trans.  We  again  tested  trans  cleavage  of  the  "ang  + 
ribose"  primer  (Figure  3-24),  as  well  as  a  mutant  of  614  (known  as  "4up6,"  which  has 
greatly  reduced  catalytic  activity),  a  pool  of  molecules  synthesized  from  the  random 
library,  and  a  individual  clone  from  the  random  library  (known  as  "Hb61o2,"  which  has 
no  detectable  catalytic  activity).  Figure  3-26  shows  that  each  substrate  was  cleaved  by  d- 
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614,  although  with  varying  degrees  of  efficiency.  The  observation  that  the  single- 
nucleotide  point  mutation  of  614,  "4up6"  (which  is  likely  to  fold  similarly  to  614),  is 
cleaved  the  slowest  by  d-614,  while  the  "ang  +  ribose"  primer  is  cleaved  faster  than  other 
full  length  molecules,  agrees  with  the  prediction  that  non-productive  binding  with  "self 
can  reduce  binding  and  catalysis  by  d-614. 

Competition  Studies  of  Ribose-614  Cleavage 

We  tested  the  ability  of  the  same  three  substrates  ("ang  +  ribose"  primer,  "ribose- 
4up6,"  and  "ribose-lib61o2")  to  compete  with  r-614  for  trans  cleavage.  The  competitors 
were  synthesized  with  a  deoxy-adenosine  in  the  place  of  the  ribose-adenosine,  and  added 
in  9  fold  excess  over  ribose-614.  As  a  control,  9-fold  excess  of  d-614  was  added  to  r- 
614.  The  addition  of  270  nM  d-614  to  30  nM  r-614  increased  cleavage  to  the  same  level 
as  r-614  at  300  nM  as  a  result  of  d-614  cleavage  of  r-614  in  trans  (Figure  3-27). 

The  cleavage  of  r-614  was  reduced  by  the  addition  of  "ang  -  ribose"  primer  and 
"d-lib61o2,"  showing  that  these  molecules  compete  for  r-614  trans  catalysis.  However, 
the  addition  of  9-fold  excess  614  mutant  "4up6"  to  r-614  did  not  show  the  reduction  in  r- 
614  cleavage  seen  by  other  competitors.     Instead,  the  r-614  cleavage  was  increased, 
although  the  increase  was  far  less  than  that  seen  with  the  addition  of  d-614.  Two 
scenarios  can  explain  this  result.  First,  it  is  possible  that  although  the  catalytic  activity  of 
"4up6"  is  reduced,  the  reduction  is  offset  by  the  great  excess  of  trans  acting  "4up6." 
Alternatively,  it  is  possible  that  the  mutation  in  "4up6"  does  not  alter  its  catalytic  motif 
and  activity,  but  instead  the  mutation  prevents  binding  of  the  catalytic  motif  near  its  own 
substrate.  Thus,  incubating  "4up6"  alone  shows  almost  no  catalytic  activity  because  the 
active  catalytic  motif  cannot  bind  near  its  own  substrate,  but  if  incubated  with  another 
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molecule  that  does  not  have  a  mutation  interfering  with  binding  of  the  catalytic  motif  of 
4up6  to  the  trans  substrate,  it  may  be  able  to  cleave  in  trans. 

The  competition  study  described  above  was  done  at  high  concentrations  of 
"enzyme"  (r-614)  to  favor  inter-molecular  association  and  trans  cleavage.  We  performed 
a  similar  competition  study,  but  with  concentrations  of  r-614  and  excess  competitor 
sufficiently  low  as  to  favor  uni-molecular  cleavage.  Under  these  conditions,  association 
of  the  competitor  to  the  enzyme  -  which  is  inter-molecular  -  is  less  favored  than  uni- 
molecular  association  of  the  enzyme  with  its  own  substrate  motif.  As  predicted,  the 
addition  of  3.0  nM  d-lib61o2  to  0.3  nM  r-614  had  no  noticeable  effect  on  r-614  cleavage 
(Figure  3-28). 

d-614  Cleaves  Faster  in  trans  Than  in  cis:  Single-turnover  Kinetics 

Substrate  saturation  experiments  were  performed  with  increasing  concentration  of 
"enzyme"  (d-614)  to  determine  the  ke*(bi)  for  the  bimolecular  reaction.  Figures  3-29,  30, 
and  3 1  show  the  full  progress  curves  for  cleavage,  initial  linear  portion,  and  enzyme- 
saturation  profile  for  experiments  with  various  substrates  ("ang  +  ribose",  r-lib61o2,  or  r- 
614)  cleaved  by  d-614.  The  calculated  Kj  d  is  39,  35,  and  18  nM  for  r-primer,  r-614  and 
r-lib61o2  substrates,  respectively.  The  kcat(bi),  which  is  determined  from  the  maximum 
rate  of  cleavage  with  saturating  enzyme,  is  0.040,  0.034,  and  0.025  hr"1  for  r-primer,  r- 
614  and  r-lib61o2  substrates,  respectively. 

While  the  Kd  for  each  substrate  was  relatively  low  (18-39  nM),  the  maximum  rate 
was  not  attained  until  enzyme  concentration  reached  approximately  2000  nM.  Indeed, 
the  initial  rates  appear  to  be  increasing  slightly  even  above  2000  nM,  indicating  that  the 
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saturation  may  not  be  completely  achieved  (sufficient  enzyme  was  not  available  to  test 
kinetics  at  concentrations  higher  than  6000  nM).  The  slow  approach  to  plateau  is  most 
notable  for  the  r-lib61o2  substrate,  and  to  a  lesser  extent  with  r-614,  suggesting  that 
substrate  or  enzyme  may  exist  in  inactive,  alternative  conformations  (an  explaining  the 
low  R2  value  for  the  association  curve  with  r-lib61o2). 

A  second  experiment  conducted  with  "ang  +  ribose"  primer  as  the  substrate  and 
excess  d-614  enzyme  concentration  increasing  from  1 1  nM  to  300  nM  also  demonstrated 
progressively  increasing  rates  up  to  0.048  hr"'  at  300  nM  (Figure  3-32).  Similarly, 
r-lib61o2  cleavage  increased  progressively  when  the  enzyme  (d-614)  was  in  excess  with 
concentrations  ranging  from  13  nM  to  450  nM.  Cleavage  rates  were  always  higher  for 
"ang  +  ribose"  primer  substrate  than  for  r-lib61o2  substrate  at  comparable  d-614 
concentrations,  again  suggesting  that  the  shorter  substrate  may  have  less  self-binding 
opportunities  which  compete  with  and  therefore  reduce  binding  and  catalysis  by  d-614. 

These  enzyme  excess  saturation  experiments  reveal  that  the  rate  for  trans 
cleavage  (k^bi)  =  0.034  hr"1,  for  r-614)  is  distinctly  higher  than  the  rate  for  c/s-cleavage 
(kcat(uni)  =  0.006  hr" ).  This  6-fold  increase  in  rate  is  surprising  given  the  expectation  that 
tethering  the  "substrate"  region  to  the  "enzyme"  region,  as  in  the  m-cleaving  614,  would 
significantly  increase  the  effective  concentration  of  the  substrate  relative  to  the  enzyme, 
and  thus  dramatically  increase  the  rate.  When  /raws-acting  enzyme  reaches 
concentrations  of  3.5  to  19  nM,  the  observed  rate  is  twice  the  rate  of  c/s-cleavage,  and 
thus  represents  the  concentration  at  which  the  c/s-cleavage  rate  and  fra«s-cleavage  rate 
are  equal  (Figure  3-21).  The  effective  molarity  of  linking  the  substrate  to  the  enzyme,  in 
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this  case,  is  therfore  approximately  3.5  -  19  nM;  above  this  concentration,  trans  cleavage 
increasingly  predominates. 

The  6-fold  increase  in  rate  for  trans  cleavage  is  also  surprising  given  the 
expectation  that  both  the  "catalytic"  and  the  "substrate"  region  of  614  is  unchanged 
whether  cleavage  is  in  cis  or  trans.  The  implication  is  that  the  linker  connecting  the 
"substrate"  and  "en2yme"  in  the  c/s-acting  614  somehow  interferes  (steric  hindrance) 
with  binding  in  a  way  not  present  in  the  /rans-acting  614. 

d-614  Cleaves  with  Multiple-turnover 

In  order  to  be  considered  a  true  enzyme,  the  catalyst  must  be  unchanged  at  the 
completion  of  its  activity  and  thus  be  able  to  complete  another  round  of  catalysis.  We 
have  already  demonstrated  that  c/s-acting  614  cleaves  itself,  and  that  neither  product 
retains  significant  catalytic  activity,  hence  it  behaves  as  an  auto-catalytic  molecule,  but 
not  classically  as  an  enzyme.  We  have  also  demonstrated  that  d-614  can  act  catalytically 
in  trans  and  is  not  altered  in  the  process.  We  not  proceed  to  test  whether  the  catalytic 
activity  of  d-614  can  function  with  multiple  turnover. 

Figure  3-33  and  34  show  the  result  of  incubating  either  "ang  +  ribose"  or  r-lib61o2 
substrate  in  four-fold  excess  over  d-614  enzyme  (either  at  133:33  or  400:100  nM 
substrate:enzyme).  Multiple  turnover  was  achieved,  but  rates  were  quite  slow.  At  the 
highest  concentration  of  enzyme  and  substrate,  2  turnovers  of  substrate  was  observed  at 
100  hours  for  "ang  +  ribose"  primer  substrate  and  200  hours  for  the  r-lib61o2  substrate. 
As  seen  under  single-turnover  conditions,  the  "ang  +  ribose"  substrate  cleaved  both 
faster,  and  at  lower  concentrations,  than  the  full-length  r-lib61o2  substrate  under  multiple- 
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turnover  conditions  (compare  the  dramatic  increase  in  multiple  turnover  cleavage  for  r- 
lib61o2  cleavage  when  d-614  concentration  is  increased  from  33  to  100  nM  with  the 
modest  increase  for  "ang  +  ribose"  primer  cleavage,  indicating  that  the  "ang  +  primer" 
cleavage  is  near  its  maximum  rate  of  turnover).  This  may  be  attributable  to  the  better 
ability  of  the  "ang  +  ribose"  substrate  to  bind  with  or  disassociation  from  the  enzyme,  or 
its  lower  proclivity  for  forming  nonproductive  interactions. 

Multiple  turnover  conditions  were  also  tested  by  holding  d-614  concentration 
constant  at  20  nM  and  varying  "ang  +  ribose"  concentration  from  100  to  2000  nM 
(Figure  3-35).  Approximately  two  turnovers  were  observed  in  the  first  100  hours,  while 
only  5-6  turnovers  were  observed  by  400  hours. 

If  the  rate  limiting  step  is  disassociation  of  the  substrate  from  the  enzyme,  we 
would  expect  to  see  a  "burst  phase"  in  the  initial  phase  of  multiple  turnover  kinetics  in 
which  all  of  the  initially  bound  substrate  is  cleaved,  followed  by  a  slower  phase  in  which 
the  rate  is  limited  by  the  product  release.  The  duration  of  the  burst  phase  corresponds  to 
the  fraction  of  the  substrate  that  is  initially  bound  to  the  enzyme.  If  the  enzyme  binds 
efficiently  to  the  substrate,  such  that  nearly  all  the  enzyme  has  bound  a  substrate  when 
the  substrate  is  in  excess,  then  this  burst  phase  corresponds  to  the  molar  fraction  of 
enzyme  to  substrate.  However,  if  the  enzyme  does  not  bind  substrate  efficiently,  or  the 
substrate  has  competition  for  binding  the  enzyme,  then  the  burst  phase  will  be  reduced. 
Figures  3-33,  34  and  35  show  that  the  reaction  remains  linear  well  beyond  the  initial 
turnover,  indicating  that  there  is  not  a  burst  phase  and  that  the  overall  rate  is  not  limited 
by  E»S  disassociation. 
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Testing  614  Rate  of  Association  and  Disassociation 

Values  obtained  above  for  cleavage  rates  of  614  are  comparable  to  rates  obtained 
for  magnesium  independent  DNAzymes,  but  apparently  slower  than  those  reported  for 
other  ribonuclease  DNAzymes  generated  by  IVS  (it  is  difficult  to  compare  results  directly 
since  other  DNAzymes  are  often  characterized  at  higher  temperatures  and  under  alkaline 
conditions,  both  of  which  dramatically  increase  cleavage  rates).  One  potential 
explanation  for  this  may  be  the  difference  in  the  double  strand  DNA  clamps  that  were 
engineered  to  fold  the  DNAzyme  onto  itself  and  thus  bring  the  presumably  catalytic  N40 
region  in  proximity  to  the  ribose  substrate  site.  The  lack  of  published  studies  on  the 
impact  of  these  engineered  clamps  presumably  reflects  an  assumption  by  the  IVS 
community  that  results  obtained  from  in  vitro  selections  should  be  independent  of  subtle 
changes  in  "starting  conditions,"  and  that  the  overall  outcome  is  indicative  of  the  inherent 
ability  of  nucleic  acids  to  perform  the  selected  behavior.  This  logic  suggests  that  the  two 
nucleotide  changes  made  to  the  "BJ  +  ribose"  primer  to  produce  the  "ang  +  ribose" 
primer  should  make  little  difference  to  the  overall  results.  Nonetheless,  these  two  "point 
mutations"  have  the  potential  to  dramatically  alter  the  ability  of  the  DNAzyme  to  fold 
onto  itself  in  the  way  engineered  by  Breaker  and  Joyce.  The  proclivity  of  614  for  acting 
in  trans  also  suggests  that  DNA  folding  may  be  crucial  our  understanding  of  614  activity. 

In  our  standard  protocol  for  initiating  kinetics,  enzyme  and  substrate  are  mixed 
together  in  reaction  buffer,  and  then  heated  to  97°C  for  3  minutes,  followed  by  a  slow 
cool  to  23  °C  over  10  minutes.  The  purpose  of  this  "slow  cool"  is  to  ensure  complete 
denaturation  of  substrate  and  enzyme  at  the  onset  of  the  experiment,  and  allow  specific 
control  of  the  folding  conditions.  Initially  it  was  expected  that  catalysis  was  dominated 
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by  cz's-cleavage,  but  our  studies  above  indicated  a  significant  amount  of  /raws-cleavage  at 
concentrations  higher  than  3.5  -  19  nM.  Whereas  c/s-cleavage  entails  only  intra- 
molecular folding,  /rans-cleavage  entails  two  separate  folding  interactions,  intra- 
molecular  folding  (whether  it  is  enzyme  or  substrate)  and  /«ter-molecular  folding 
(whether  it  is  between  enzyme  and  substrate,  or  potentially  between  enzyme  and 
inhibitor).  Although  phenomenalogically  different,  both  types  of  folding  occur 
simultaneously. 

We  tested  the  impact  of  our  "slow  cool"  folding  protocol  on  intra-molecular 
folding  by  comparing  the  cleavage  of  ribose-614  with  and  without  the  "slow  cool" 
folding  at  a  concentration  low  enough  as  to  favor  c/s-cleavage  (2  nM)  (Figure  3-36).  The 
overall  cleavage  of  r-614  was  higher  when  ribose-614  was  completely  denatured  and 
slowly  cooled,  as  opposed  to  simply  re-suspending  the  pellet  in  HEPES  buffer  and 
initiating  the  reaction  by  adding  NaCl  and  MgCb,  suggesting  that  the  "slow  cooling" 
procedure  enhances  intra-molecular  folding.  It  is  possible,  however,  that  this  rate 
enhancement  is  attributable  to  the  denaturation  induced  by  the  high  temperatures  in  the 
slow  cool  protocol,  rather  than  enhancement  of  the  intra-molecular  folding,  suggesting 
that  a  fraction  of  the  ribose-614  is  locked  into  an  inactive  conformation  as  a  result  of  the 
ethanol  precipitation.  If  this  is  the  case,  instead  of  improving  intra-molecular  folding,  we 
must  say  that  the  "slow  cool"  procedure  improves  "intra-molecular  re-folding." 

When  the  same  experiment  was  done  at  higher,  trans-favoring  concentrations,  the 
initial  rate  of  ribose-614  cleavage  is  unchanged,  but  the  plateau  is  slightly  higher  with  the 
"slow  cooling"  treatment  (Figure  3-37).  Similarly,  cleavage  of  1 .2  nM  r-614  by  150  nM 
d-614  was  identical,  regardless  of  whether  the  "enzyme"  and  "substrate"  were  mixed 
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prior  to  or  subsequent  to  the  "slow  cool"  folding  protocol  (data  not  shown).  This 
suggests  that,  although  the  "slow  cool"  folding  procedure  enhances  c/s-cleavage,  it  has 
little  effect  on  trans-cleavage  (which,  because  it  is  faster,  predominates  the  initial  rate). 

The  impact  of  the  "slow  cool"  folding  protocol  on  inter-molecular  folding  was 
also  tested  directly  by  mixing  the  enzyme  (d-614)  and  substrate  (either  r-lib61o2  or  "ang 
+  ribose"  primer)  either  before  or  immediately  after  the  "slow  cool"  procedure  (Figure 
3-33  and  34).  Multiple  turnover  experiments  showed  very  little  difference  in  cleavage, 
either  in  initial  rates  or  cleavage  plateau.  In  fact,  completely  omitting  the 
denaturing/slow  cool  protocol  had  no  effect  on  trans -cleavage  (Figure  3-38).  This  further 
demonstrates  that  inter-molecular  folding  is  not  significantly  effected  by  the  "slow  cool" 
protocol. 

To  test  whether  the  rate-limiting  step  for  ribose-614  /ra/w-cleavage  is  the  inter- 
molecular  association  or  the  chemical  step,  a  chase  experiment  was  performed  with  cold 
competitor.  Ribose-614  was  incubated  at  sufficiently  high  concentration  (200  nM)  as  to 
achieve  significant  trans-cleavage.  The  sample  was  allowed  to  fold  for  an  additional 
thirty  minutes  after  "slow  cooling,"  at  which  point  15-fold  excess  unlabelled  chase  ("ang 
+  ribose")  was  added.  A  significant  decrease  in  cleavage  was  observed  immediately 
following  the  addition  of  the  cold  chase  (as  compared  to  ribose-614  without  added  chase) 
(Figure  3-39).  Comparing  "200  nM  ribose-614  plus  chase"  with  "2  nM  ribose-614 
without  chase"  reveals  that  the  chase  has  reduced  cleavage  levels  below  even  that  seen 
for  a  nearly  all  cz's-cleavage  reaction  (Figure  3-40). 

If  the  rate  of  substrate  disassociation  from  enzyme  is  faster  than  the  cleavage  step 
(kcat(bi) «  k-i),  the  addition  of  a  cold  chase  should  completely  eliminate  subsequent 
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cleavage.  If,  however,  kcat(bi)>:>  k_i,  the  addition  of  the  chase  should  have  no  impact  on 
subsequent  cleavage  (assuming  sufficient  time  has  been  granted  before  the  addition  of  the 
chase  to  ensure  that  all  the  substrate  is  bound  by  the  enzyme).  The  addition  of  chase  to  r- 
614  resulted  in  an  intermediate  result:  cleavage  was  dramatically  reduced,  but  cleavage 
continued  to  increase  following  the  chase.  Several  scenarios  may  be  put  forward  to 
explain  this: 

(a)  The  rate  of  disassociation  of  substrate  from  enzyme  is  within  an  order  of 
magnitude  of  the  rate  of  cleavage  (kcat(bi)  ~  k-0- 

(b)  C/s-cleavage  may  not  be  completely  quenched  by  the  chase  (or  at  least  not  at  the 
concentration  of  chase  used).  The  residual  cleavage  following  the  addition  of  the 
chase  could  therefore  be  c/s-cleavage.  The  apparent  rate  of  cleavage  with  added 
chase  could  be  lower  than  the  rate  of  c/'s-cleavage  without  chase  if  ds-cleavage 
normally  occurs  through  a  process  of  folding,  unfolding,  and  refolding,  and  at 
some  point  during  the  folded  state,  it  self-cleaves.  Given  this  scenario,  adding 
chase  may  reduce  some  c/'s-cleavage  by  competing  with  614  for  c/s-folding.  This 
implies  that  k^uni)  ~  k-i(uni) 

(c)  Not  all  of  the  substrate  is  bound  by  the  enzyme  at  the  time  the  chase  is  added,  so 
even  though  kcat(bi)»k-i,  the  addition  of  the  chase  competes  for  association  of 
substrate  to  enzyme  and  subsequently  eliminates  cleavage  of  unbound  substrate. 
This  implies  that  the  rate  limiting  step  is  association  (kcaKbi^ki). 

(d)  Insufficient  chase  has  been  added.  If  kcat(bi)<<:k-i,  the  enzyme-substrate  complex 
will  disassociate  rapidly.  If  association  is  also  relatively  rapid,  the  enzyme  may 
circulate  through  a  number  of  chase  molecules  before  it  finds  labeled  substrate. 
The  reduction  in  cleavage  is  therefore  proportional  to  the  number  of  chase 
molecules  plus  substrate  molecules  relative  to  enzyme  molecules. 

The  chase  experiment  was  therefore  repeated  with  low  concentrations  of  r-614 
and  high  concentrations  of  d-614  (4nM  r-614,  2025  nM  d-614  and  10  uM  chase  added  at 
t  =  4  hours,  or  0.5  nM  r-614,  64  nM  d-614  and  1280  nM  chase  added  at  t  =  45  minutes). 
Similar  results  were  observed:  "ribose-614  plus  chase"  was  gradually  cleaved,  but 
always  at  levels  far  below  the  "minus  chase"  condition. 
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Previous  enzyme-saturation  experiments  indicate  that  2025  nM  d-614  is  sufficient 
to  approach  substrate  saturation,  therefore  arguing  against  scenario  (c)  as  a  likely 
explanation  for  the  fact  that  there  is  residual  cleavage  following  the  addition  of  the  chase. 
The  ratio  of  enzyme  (d-614)  to  chase  ("ang  +  ribose")  in  these  latter  two  experiments  was 
either  1 :5  or  1 :20.  Slightly  less  cleavage  was  observed  after  the  addition  of  chase  in  the 
1:20  enzymexhase  experiment,  suggesting  scenario  (d)  as  a  possible  explanation  for 
residual  cleavage  following  the  chase. 

Chase  experiments  conducted  with  ribose-614  cleaving  in  cis  (2  nM)  showed  no 
significant  change  in  cleavage  following  the  addition  of  30  nM  chase.  This  suggests  that 
either  intra-molecular  association  is  complete  prior  to  the  addition  of  the  chase  and  it 
does  not  unfold  prior  to  cleavage,  or  that  intra-molecular  folding  is  not  affected  by  the 
presence  of  competitors  at  30  nM.  Unlike  trans-acting  catalysts,  the  absolute 
concentration  -  not  just  the  proportional  dilution  -  can  alter  the  effectiveness  of  a  chase. 
Previous  experiment  have  demonstrated  that  low  concentration  of  competitors  (<3  nM) 
added  at  time  =  0  do  not  compete  with  r-614  cleavage,  while  higher  levels  of  competitor 
(270  nM)  reduce,  but  do  not  eliminate,  ribose-614  cleavage  (30  nM)  (Figure  3-27  and 
28).  Taken  together,  this  suggests  that  at  least  some  r-614  ds-cleavage  can  persist 
despite  increasing  concentrations  of  competitor,  supporting  the  plausibility  of  scenario 
(b). 

A  chase  experiment  was  therefore  performed  using  trace  amounts  of  "ang  + 
ribose"  primer  (4  nM  )  as  a  substrate  with  near-saturating  levels  of  d-614  (2000  nM  ) 
enzyme.  This  substrate  has  no  intrinsic  m-cleaving  potential,  eliminating  the  possibility 
that  cleavage  following  the  chase  is  caused  by  c/s-cleavage.  Furthermore,  the  unlabelled 
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"ang  +  ribose"  chase  was  added  in  either  2.5-  or  5-fold  excess  over  the  enzyme.  The 
results  indicate  that  cleavage  was  greatly  reduced,  but  continued  even  after  the  addition 
of  the  chase  (Figures  3-39  and  40).  The  reduction  was  slightly  greater  with  additional 
chase.  Together  these  experiments  indicate  that  the  rate  of  E»S  disassociation  is  greater 
than  the  rate  of  chemical  step  (k.|  »  kc«(bt))  and  the  residual  cleavage  is  a  function  of  the 
ratio  of  substrate  and  chase  to  enzyme  (scenario  (d)). 

The  chase  experiments  indicate  that  inter-molecular  association  (folding)  may  be 
the  rate-limiting  step  or  that,  once  folded,  the  substrate  disassociates  from  enzyme  prior 
to  cleavage.  This  was  further  explored  by  examining  the  temperature  dependence  of 
initial  cleavage  rates.  If  the  chemical  step  is  the  rate-limiting  step,  lowering  the 
temperature  should  lower  the  rate  of  cleavage,  whereas  raising  the  temperature  will 
increase  the  rate  (until  the  molecules  denatures).  However,  when  folding  is  rate  limiting, 
lowering  temperature  is  expected  to  stabilize  folding  and  thus  increase  cleavage  rate. 
Figures  3-41  and  42  shows  that  lowering  incubation  temperature  increases  the  initial 
r-614  cleavage  rate  in  both  cis  and  trans  conditions. 

As  the  chemical  step  is  expected  to  decrease  approximately  two-fold  for  every 
10°C  drop  in  temperature,  the  observation  that  a  rate  is  unchanged  by  a  20°C  change  in 
temperature  suggests  that  a  balance  is  reached  between  a  slower  chemical  step  (kcat(unj)  or 
kcat(bi))  and  a  more  stabilized  association  step  (ki(uni)  or  ki(bi>)  at  lower  temperatures. 

The  enhancement  of  the  folding/association  step  at  lower  temperatures  is  also 
seen  with  d-614  cleaving  r-lib61o2  in  trans  during  single-turnover  experiments  (Figure 
3-42B).  Multiple-turnover  experiments  reveal  a  rate  enhancement  at  lower  temperatures 
during  the  first  20%  of  cleavage,  following  which  the  rate  is  reduced  (Figure  3-38).  This 
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"burst  phase"  seen  under  multiple  turnover  conditions  at  lower  temperature  (while  not 
seen  at  higher  temperatures)  indicates  lower  temperatures  increase  the  initial  rate  by 
stabilizing  intermolecular  association  between  enzyme  and  substrate;  this  stabilization 
also  slows  the  disassociation  of  product  from  enzyme  and  therefore  reduces  the  rate  for 
multiple-turnover  subsequent  to  the  first  catalysis. 

Structural  Analysis  of  614 

The  ability  of  614  to  cleave  in  cis  is  not  particularly  surprising,  both  because  this 
is  the  function  for  which  it  was  selected,  and  because  many  ribozymes  function  to  cleave 
in  cis.  It  is,  however,  rather  interesting  that  614  also  possesses  the  ability  to  cleave 
various  substrates  in  trans,  and  furthermore,  that  the  rate  constant  for  trans  cleavage  is  in 
fact  6-fold  higher  than  the  rate  constant  for  c/s-cleavage.  The  ability  of  614  to  cleave  the 
"ang  +  ribose"  primer,  as  well  as  a  library  of  molecules  containing  the  "ang  +  ribose" 
primer,  suggests  that  614  has  a  "binding  motif  which  base-pairs  with  some  part  of  the 
"ang  +  ribose"  primer  common  to  all  substrates.  This  binding  positions  a  separate 
"catalytic  motif  near  the  ribo-adenosine. 

To  understand  how  this  may  work  in  both  cis  and  trans,  the  Stewart  and  Zuker 
mfold  program  was  used  to  generate  several  structure  predictions  for  614  in  cis  and  trans. 
Of  the  many  structures  proposed  by  the  mfold  program,  only  those  that  positioned  the 
ribo-adenosine  in  a  "constrained  but  accessible"  manner  were  selected  -  structures  were 
eliminated  if  the  ribo-adenosine  was  either  embedded  in  a  lengthy  helix  (inaccessible)  or 
if  it  was  positioned  in  the  middle  of  an  extended  loop  (unconstrained). 

Figure  3-43  shows  eight  of  lowest  free  energy  structure  predictions.  Despite  the 
diversity  of  candidate  structures,  several  common  folds  were  present  and  used  to  group 
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structures  into  Families  A,  B,  and  C.  The  four  structures  of  Family  A  each  possess  a 
75GATA78  stem  loop  (the  superscript  numbers  refer  to  the  position  in  the  overall 
sequence),  but  vary  in  the  fold  of  nucleotides  35  -70.  Both  structures  in  Family  B  share 
the  same  fold  between  nucleotides  50  and  77,  but  vary  in  the  fold  adopted  by  nucleotides 
30  through  50  and  77  through  84.  Structure  Al  and  Bl  share  the  same  fold  for 
nucleotides  30  through  47.  Both  structures  in  the  Family  C  share  the  same  folding 
pattern  for  nucleotides  35-75. 

Each  of  the  three  Families  A,  B  and  C  can  fold  in  such  a  way  as  to  form  base- 
pairing  between  the  17CGACTCACTAT27  (part  of  the  5'-"ang  +  ribose"  primer)  and 
85GTAGTGACG93  (part  of  the  3'  primer).  These  5'  and  3'  regions  can  also  fold  into 
separate  hairpins,  preventing  interaction  with  each  other,  as  seen  in  structure  C2.  The  3' 
hairpin  formed  between  nucleotides  78  and  101  disrupts  the  defining  elements  of  both 
Family  A  and  B  structures,  but  not  the  Family  C  structure  (as  seen  in  C2). 

We  have  demonstrated  that  614  can  cleave  "ang  +  ribose"  primer  (both  in  trans 
and  when  incorporated  as  nucleotides  1-49  of  614).  If  we  assume  that  the  structure 
formed  when  614  cleaves  in  cis  is  similar  to  the  structure  formed  when  614  cleaves  "ang 
+  ribose"  primer  in  trans,  then  this  suggests  that  the  c/s-folded  614  has  some  base-pairing 
between  its  first  49-nt  and  the  remaining  3'  end.  The  structure  shown  in  C2  allows  only 
minimal  interaction  between  the  "ang  +  ribose"  primer  region  and  the  remaining  3'  half 
(base  pairing  between  33CATGGC40  and  71GCCTG75).  However,  the  base-pairing 
between  nucleotides  l7CGACTCACTAT27  and  85GTAGTGACG93  seen  in  the  structures 
A 1-4,  Bl,  B2  and  CI  allows  significant  interaction  between  the  5'  and  3'  halves  of  614. 
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These  latter  structures  therefore  seem  more  probable  for  both  c/s-cleavage  and  trans- 
cleavage. 

This  was  tested  directly  by  mutating  nucleotides  T86  to  A,  G88  to  C,  and  G90  to 
C  in  614  (Figure  3-44).  These  introduced  mutations  greatly  diminished  the  ability  of  614 
to  cleave  itself,  as  well  as  the  substrate  r-lib61o2  in  trans  (compared  to  wild-type  614). 
Introducing  the  compensatory  changes  (C22G,  C24G,  A26T)  predicted  by  the  structures 
of  A 1-4,  Bl,  B2  and  CI  into  the  substrate  r-lib61o2  significantly  improved  trans  cleavage. 
Introducing  the  compensatory  changes  C22G,  C24G,  A26T  into  the  ribose-614 
containing  T86A,  G88C,  and  G90C  also  dramatically  improved  catalysis.  This 
demonstrates  that  the  helix  formed  between  17CGACTCACTAT27  and  85GTAGTGACG93 
shown  in  the  structures  A 1-4,  Bl,  B2  and  CI  is  probaby  formed  in  cis  and  trans. 

Attempts  were  made  to  eliminate  some  of  the  remaining  candidate  structures. 
Although  structure  CI  appears  quite  "linear"  and  does  not  appear  to  position  a  folded 
"catalytic  motif  near  the  ribo-adenosine,  it  does  possess  the  requisite  base-pairing 
between  the  17CGACTCACTAT27  and  85GTAGTGACG93  regions.  Because  the  5'  49- 
nucleotides  of  614  correspond  to  the  "ang  +  ribose"  substrate,  the  essential  elements  of 
structure  CI  could  be  formed  by  a  614/"ang  +  ribose"  enzyme/substrate  conjugate  if  the 
"ang  +  ribose"  primer  displaces  the  5'  half  of  614.  If  structure  CI  were  indeed  the 
correct  structure,  the  3'  half  of  614  (nucleotides  56  through  94)  should  be  sufficient  to 
bind  and  cleave  the  "ang  +  ribose"  substrate.  This  was  tested  by  generating  the  3'  half  of 
614  (both  "614  56-94nt"  and  "614  56-106nt").  Truncating  614  at  position  94  eliminates 
the  possibility  of  forming  structure  C2,  while  still  permitting  structure  CI .  When  mixed 
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with  "ang  +  ribose"  primer,  neither  "614  56-94nt"  nor  "614  56-106nt"  catalyzed  cleavage 
of  "ang  +  ribose",  suggesting  that  structure  CI  is  not  correct. 

Screening  of  mutants  of  614  revealed  that  any  single  mutation  of  position  55,  65 
or  72  greatly  reduced  catalytic  activity.  Either  of  the  G55A  or  G65A  mutations  have  the 
potential  to  disrupt  the  structure  of  A4.  Each  of  these  mutants  was  made  with  the 
compensatory  mutations  predicted  by  structure  A4  (614:G55A/C47T,  and 
614:G65A/C40T).  Neither  compensatory  mutation  improved  catalysis  over  the  single 
mutant  (Figure  3-45),  arguing  against  the  validity  of  structure  A4. 

The  C72T  mutant  also  had  reduced  catalytic  activity;  this  mutation  is  predicted  to 
disrupt  a  base-pair  with  G81  in  all  of  the  Family  A  structures.  The  validity  of  these 
structures  was  tested  by  making  the  compensatory  G81 A  mutation.  The  double-mutant, 
however,  cleaved  even  slower  than  the  single  mutant,  suggesting  against  the  validity  of 
any  of  the  Family  A  structures  (Figure  3-45).  It  is,  however,  possible  this  stem-loop 
predicted  in  Family  A  is  in  fact  part  of  the  correct  structure,  but  the  GC  base-pair  cannot 
be  replaced  with  an  AT  base-pair  (either  because  the  strength  of  the  AT  base-pair  is 
lower,  or  because  the  original  G  or  C  played  a  catalytic  function  in  addition  to  its 
structural  role  in  the  stem  loop). 

Position  81  of  wild-type  614  was  therefore  converted  from  a  G  to  a  C  (named 
"614:G81C").  The  G81C  mutation  has  the  potential  to  disrupt  the  structures  of  both 
Family  A  and  the  structure  B2,  although  in  different  manners.  As  mentioned  above,  G8 1 
is  predicted  to  base-pair  with  C72  in  Family  A  structures;  G8 1  is  predicted  to  base  pair 
with  C43  in  structure  B2.  The  614:G81C  mutant  demonstrated  greatly  reduced  self- 
cleavage,  allowing  us  to  test  both  Family  A  and  B2  structures  by  making  the  predicted 
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compensatory  mutations  ("614  G81C,  C72G"  and  "614  G81C,  C43G").  Neither  of  the 
double-mutations  restored  catalytic  activity,  suggesting  that  neither  structure  B2  nor  the 
structures  of  Family  A  are  in  fact  the  actual  structure.  It  remains  possible,  however,  that 
one  of  the  structures  of  Family  A  or  structure  B2  is  in  fact  correct,  but  G81  is  crucial  to 
catalysis  for  reasons  beyond  its  ability  to  base-pair  with  either  C72  or  C43. 

The  evidence  above  argues  against  the  validity  of  Family  A  and  C  structures,  as 
well  as  structure  B2,  leaving  only  one  candidate  structure  remaining  (Bl).  If  structure  Bl 
represents  the  correct  fold  for  614  acting  in  cis,  and  the  essential  structural  elements  are 
preserved  when  614  cleaves  in  trans,  then  nucleotides  41  through  91  should  be  sufficient 
for  cleaving  the  "ang  +  ribose"  substrate.  Because  this  truncated  614  ("614  trunc41-91") 
can  conceivably  adopt  the  other  structures  of  Family  A  and  C,  the  "ang  +  ribose" 
substrate  was  altered  to  improve  binding  if  it  adopts  the  Bl  structure  but  not  by  other 
structures  (this  "Bl  substrate"  was  produced  by  truncating  "ang  +  ribose"  to  include  only 
nucleotides  21-35,  with  a  G  inserted  between  nucleotides  31  and  32,  and  A34  changed  to 
T,  resulting  in  5'TCACTATrAGGAGAGTC").  The  catalytic  ability  of  "614  trunc41-91" 
was  then  tested  with  both  "ang  +  ribose"  and  "Bl  substrate."  Neither  substrate  was 
cleaved  by  "614  trunc41-91,"  arguing  against  our  final  proposed  structures,  Al  or  Bl. 

Structure  predictions  were  made  for  Family  A,  B  and  C  with  614  acting  in  trans 
(Figure  3-46).  The  ability  of  614  to  cleave  the  "ang  +  ribose"  substrate  demonstrates  that 
nucleotides  50-106  are  not  a  required  element  of  the  substrate  for  614  /raws-cleavage. 
The  structure  predictions  shown  for  614  cleaving  in  trans  are  those  using  "ang  +  ribose" 
substrate,  but  are  essentially  the  same  for  full-length  substrates. 
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Figure  3-46  shows  that  a  four  base-pair  helix  can  be  formed  between  nucleotides 
35CATG38  of  614  and  the  identical  nucleotides  of  the  "ang  +  ribose"  substrate.  This  helix 
is  not  formed  when  614  cleaves  in  cis  because  the  sequence  CATG  sequence  occurs  only 
once.  This  short  helix  presumably  offers  additional  stabilization  between  the  "ang  + 
ribose"  substrate  and  the  614  enzyme  without  disrupting  potentially  catalytic  helixes  of 
structures  Al  and  C  (although  it  does  disrupt  part  of  structures  A2  and  A3). 

Interestingly,  a  mutational  screening  of  614  isolated  a  G38A  mutant  (called 
"june#2  G38A")  that  had  reduced  activity  (compared  to  wild-type  614)  when  tested  at 
100  nM  (a  concentration  at  which  significant  trans  cleavage  occurs). 

The  significance  of  the  predicted  35CATG38  ^raws-helix  was  tested  by  introducing 
the  compensatory  mutation  into  the  non-catalytic  substrate  r-lib61o2  (the  compensatory 
mutant  is  called  "Hb61o2  C35T").  Previous  /ra«s-cleavage  assays  demonstrated  that 
wild-type  d-614  can  cleave  ribose-lib61o2.  The  614  mutant  "june#2  G38A"  was  then 
tested  for  its  ability  to  cleave  either  "r-lib61o2"  or  "r-lib61o2  C35T."  Figure  3-47  shows 
that  the  introduction  of  the  C35T  mutation  into  the  Hb61o2  substrate  improves  cleavage 
by  the  614  mutant  "june#2  G38A,"  suggesting  that  the  35CATG38  helix  is  indeed 
important  for  fra/zs-cleavage.  The  reduction  in  cleavage  seen  when  "june#2  G38A" 
cleaves  "Hb61o2  C35T"  as  compared  to  wild-type  614  cleaving  wild-type  r-lib61o2  is 
likely  the  result  of  the  weaker  AT  base-pair  as  opposed  to  the  GC  base-pair. 

Structure  predictions  were  made  with  614  bound  to  the  "ang  +  ribose"  substrate 
via  the  two  experimentally  supported  interactions:  base-pairing  between  35CATG38  of  the 
614  and  the    CATG38  of  the  substrate,  and  base-pairing  between  the 
17CGACTCACTAT27  region  of  the  substrate  and  the  85GTAGTGACG93  region  of  the 
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enzyme.  These  ^raws-interactions  interactions  eliminate  structures  formed  for  cis-folded 
614  in  the  first  38  nucleotides.  Most  of  the  structural  elements  predicted  for  c/s-folded 
614  are  preserved  in  the  trans-folded  614/substrate  complex  (Figure  3-46).  The  free- 
energy  predictions  for  the  /raws-folded  614/substrate  complexes  were  all  much  lower 
than  the  c/s-folded  614  structures  (cis-614  dG  =  approximately  -12  kcal/mol,  trans 
614:substrate  dG  =  -20  kcal/mol).  This  agrees  well  with  previous  experimental  results 
indicating  that  overall  cleavage  rate  is  affected  by  folding  and  occurs  faster  in  trans. 

Structural  probing  was  also  performed  using  chemical  modifications  specific  for 
single-stranded  DNA.  The  trans-folded  conformation  was  favored  by  incubating  high 
concentration  of  5'-labeled  d-614  (300  nM)  in  reaction  buffer  with  and  without  excess 
unlabeled  primer  (10  uM).  The  samples  were  allowed  to  incubate  overnight  to  maximize 
folding,  after  which  they  were  treated  with  either  potassium  permanganate  (specifically 
modifies  T)  or  DMS  (specifically  modifies  G  at  high  salt  concentrations)  for  two  minutes 
and  five  minutes.  Recovered  samples  were  run  on  a  PAGE-urea  gel  next  to  a  10-bp 
ladder.  All  T  and  G  nucleotides  demonstrated  some  sensitivity  to  the  respective  chemical 
modifications,  indicating  that  either  some  614  is  not  folded  at  all,  or  adopts  multiple 
conformations.  Nonetheless,  contrast  levels  were  adjusted  to  discern  the  J/iosf-protected 
nucleotides  (suggesting  they  are  base-paired).  Figure  3-48  shows  the  result  of  this 
analysis,  indicating  that  T44,  T46,  T74,  G33,  G41,  G65,  G71,  and  G75  all  show  some 
degree  of  differential  protection.  The  pattern  of  protected  and  sensitive  positions 
observed  under  chemical  modifications  is  not  simultaneously  compatible  with  any  single 
structure  predicted  in  Figure  3-43  for  m-folded  614  or  Figure  3-46  for  /ra«s-folded  614, 
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suggesting  that  either  a  mixture  of  multiple  trans- folded  conformers  exists,  or  none  of  the 
predicted  structures  accurately  reflect  the  true  structure  of  614. 

Structural  investigations  of  614  are  continuing.  It  is  interesting  to  note  that  a  vast 
majority  of  reported  structures  for  in  vitro  selected  DNAzymes  are  based  on  unverified 
predictions  using  the  mfold  program.  The  mfold  program  has  been  shown  to  be  useful  for 
providing  plausible  structures,  but  our  experience  with  614  demonstrates  that  unverified 
predictions  of  DNAzymes  should  be  taken  very  lightly. 

Summary  of  Results 
Table  3-1  (on  next  page)  summarizes  the  major  results  of  experiments  described 
in  this  chapter. 
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Table  3-1.  Summary  of  Results  from  Chapter  3 


Reasons  for  cleavage  plateau  of  614 

•  Missing  ribose-A  in  -5%  of  primer  (higher  for  caged  ribose  due  to  synthesis). 

•  Alternative  inactive  conformation  -15%. 

•  Mutations  that  lower  catalytic  activity  in  both  primer  and  N40  regions. 

•  Not  insufficient  laser. 

•  Not  complementary  strand  inhibition. 

Kinetic  analysis 

•  614  cleaves  in  cis  (unimolecular)  at  low  concentrations  (<  20  nM). 

•  614  cleaves  in  trans  (biomolecular)  at  high  concentrations. 

•  614  cleaves  various  substrates  in  trans. 
•kcat(bi)- 0.035  hr"1 

•kobs(uni)=  0.006  hr"1 

•  The  increase  in  the  observed  rate  of  cleavage  at  lower  temperatures  indicates 

that  the  rate  is  a  balance  between  the  chemical  step  and  the  association  step. 

•  The  rate  of  E»S  disassociation  (k_i(bi))  for  fra«s-cleavage  is  greater  than  the 

chemical  step  (kcat(bi)) 

Structural  Analysis 

•  Many  structures  for  cz's-folded  614  are  possible,  but  none  have  yet  to  be  proven. 

•  There  is  an  interaction  between  17CGACTCACTAT27  and  85GTAGTGACG93 


that  occurs  in  both  cis  and  trans-folded  614. 
here  is  an  interaction  between  35CA1 
that  cannot  occur  in  cw-folded  614. 


There  is  an  interaction  between  35CATG38  of  the  Enzyme  (614)  and  Substrate 
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Lamda   exonuclease  --      +      +      --      +      + 

strong  base  -+-+-+-+ 

ribose    or   caged-r  RRRRCCCC 

LANE    #  12345678 


106-nt    full-length    614    > 


♦•*     #••- 


79-nt  cleaved  product  >  m   fe  jfe 


Figure  3-1.  Lamda  exonuclease  degrades  the  5'-phosphorylated  complementary 
stand  of  ribose-614  and  caged-614.    Ribose-614  and  caged-614  were  generated  via 
PCR;  both  the  catalytic  and  complementary  strand  were  labeled  with  alpha-CTP. 
Treatment  of  double  stranded  ribose-6 1 4  with  strong  base  converts  the  catalytic  strand 
to  the  cleaved  product,  while  treatment  with  exonuclease  degrades  the  complementary 
strand  to  single-nucleotides  (not  shown  on  gel).  PCR  with  the  caged-ribose  primer 
generates  a  complementary  strand  that  is  that  is  the  same  length  as  the  cleaved 
catalytic  strand,  corresponding  to  the  where  the  polymerase  terminates  elongation 
when  encountering  the  caged-ribose  in  the  template. 
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106  nucleotide  template  extended  with: 
A  5'-end  labelled  complimentary  strand  primer:      "short"  +  linker  +  tail 
*  5'-end  labelled  catalytic  strand  primer:  "ang  +  ribose" 


Figure  3-2.  Asymmetric  PCR  using  various  linkers  increases  the  size  of  the 
complementary  strand  without  increasing  the  size  of  the  catalytic  strand.    PCR 

reactions  where  done  in  pairs  with  the  catalytic  strand  primer  end  labeled  in  the  odd- 
numbered  lanes,  while  the  complementary  strand  primer  was  end-labeled  in  the  even- 
numbered  lanes. 


+ 
+ 


lane  #    1 


+ 
+ 


tail  on  complimentary  strand  primer 
caged  ribose  on  catalytic  strand  primer 
exonuclease 
NaOH 

121  nt:  full  length  +  15  nt  tail 

106  nt:  full  length  product 

92  nt:     full  length  product 

+  15nttail-29nt5'ofcage 

79  nt:     cleaved  product 


Figure  3-3.  Size  difference  between  catalytic  and  complementary  strands 
generated  by  asymmetric  PCR  allow  for  gel-purification  of  the  catalytic  strand. 

The  presence  of  the  cage-ribose  in  the  template  causes  truncation  of  the 
complementary  strand.  Asymmetric  PCR  therefore  generates  a  complementary  strand 
that  is  either  longer  or  shorter  than  the  catalytic  strand. 
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Figure  3-4.  Maximum  deprotection  of  caged-ribose  primers  occurs  with  100 
pulses  of  laser.    End-labeled  "ang  +  cage"  primer  was  exposed  to  increasing  pulses  of 
laser  (308  nM,  50  mJ/pulse)  and  the  amount  of  deprotection  determined  by  subsequent 
cleavage  in  0.5  M  NaOH  for  60  minutes  at  80°C.  Cleavage  reaches  a  plateau  of 
approximately  80%  with  100  pulses  of  laser.  "Ang  +  cage"  primer  exposed  to  1360 
pulses  of  laser,  but  not  treated  with  strong  base,  showed  approximately  1%  cleavage. 
Unprotected  "ang  +  ribose"  primer  was  90-97%  cleaved  under  strong  base  conditions. 
Exonuclease  purified  full-length  ribose-614  was  85-95%  cleaved  under  strong  base 
conditions.  *  indicates  incubation  with  0.25M  NaOH  rather  than  0.5M. 
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a    r-614,  no  laser 

■    c-614  400nM ,  +  laser 


hours 


Figure  3-5.  Caged-614  shows  similar  kinetics  to  ribose-614  when  deprotected  by 
150  pulses  of  laser.  For  all  kinetic  analysis,  error  is  estimated  to  be  +/-  0.05  hour  and 
less  than  +/-  5  percent  cleaved.  Identical  experiments  conducted  in  parallel  fell  within 
these  error  estimates  (see  Figure  3-30B  as  an  example). 
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70-, 


r-614  +  0  laser 
C-614+  1000  laser 

C-614  +  300  laser 
C-614+  150  laser 
C-614  +  50  laser 
C-614+  10  laser 
c-614  +  0  laser 


175 


hours 


ribose614  +  laser 

c-614  +  1000  laser 

c-614  +  300  laser 
c-614  +  150  laser 
c-614  +  50  laser 
c-614  +  10  laser 
c-614  +  0  laser 


hours 


Figure  3-6.  Progress  curves  of  caged-614  cleavage  demonstrate  maximum 
deprotection  with  150  pulses  of  laser.    (A)  Caged-614  with  less  than  150  pulses  of 
laser  had  greatly  reduced  cleavage  plateau,  indicating  that  the  sample  was  not 
completely  deprotected.  (B)  When  cleavage  was  normalized  by  adjusting  the  percent 
cleaved  out  of  only  the  fraction  of  sample  that  was  deprotected  (determined  by  NaOH 
treatment,  the  kinetic  profiles  overlap  for  all  samples  (except  0  laser). 
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Figure  3-7.  The  cleavage  plateau  of  caged-614  reaches  a  maximum  with  150 
pulses  of  laser.  Samples  of  caged-614  were  initiated  with  increasing  amounts  of  laser 
and  allowed  to  self-cleave  until  the  reaction  reached  plateau.  Samples  initiated  with 
150  pulses  of  laser  and  more  reached  equivalent  cleavage  plateaus  of -50%  (about 
-10%  lower  than  for  an  equivalent  experiment  with  ribose-614. 
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Figure  3-8.  Maximum  deprotection  of  caged-614  occurs  with  150  pulses  of  laser. 

Samples  of  caged-614  were  exposed  to  increasing  amounts  of  laser  and  then  incubated 
with  strong  base  for  1  hour  at  80°C  to  determine  the  amount  of  deprotection.    Caged- 
614  exposed  to  1 50  pulses  of  laser  is  75%  cleaved  by  the  strong  base.  Additional  laser 
does  not  increase  the  amount  cleaved  by  strong  base.  Ribose-614  has  slightly  higher 
cleavage  under  similar  conditions. 
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10.0       12.5       15.0 


r614  +  300  laser 
r614+  150  laser 
r614  +  50  laser 
r614+  10  laser 
r614  +  0  laser 


■    r-614  +  300  laser 
r-614  +  150  laser 
r-614  +  50  laser 
r-614  +  10  laser 
r-614  +  0  laser 


hours 


Figure  3-9.  Ribose-614  is  not  damaged  by  laser.  (A)  Full  time  course,  and  (B) 
linear  initial  phase,  of  ribose-614  self  cleavage  following  increasing  amounts  of  laser 
pulses.  The  cleavage  profile  of  ribose-614  is  unchanged  by  exposure  to  increasing 
number  of  laser  pulses. 
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300  nM  r614,  ImmMgCB 
100nMr614,  ImmMgCE 
33nMr614.  ImmMgCB 
11.1  nMr614,  1mmMgCI2 

300  nM  r614,  3mm  MgCC 
100nMr614,  3mm  MgCt2 
33nMr614,  3mm  MgCC 
11.1  nMr614,  3mm  MgCC 


300 


Figure  3-10.  Ribose-614  cleavage  is  not  altered  by  3-fold  variation  in  MgCh 
concentration.    Various  concentrations  of  r-614  were  incubated  in  50  mM  HEPES 
pH  7,  1  M  NaCl,  and  either  1  mM  or  3  mM  MgCk  Increasing  the  MgCb 
concentration  three-fold  had  no  significant  effect  on  r-614  cleavage  rate  at  any 
concentration. 


HEPES  -NaCl  -MgCI 
HEPES  -NaCl  +MgCI 
HEPES  +NaCI  +MgCI 
HEPES  +NaCI  +MgCI 
+cacodylate 


100  150 

hours 


200 


250 


Figure  3-11.  Ribose-614  is  inactive  without  NaCl.    Ribose-614  was  incubated 
with  50  mM  HEPES  pH  7,  plus  either  1M  NaCl,  1  mM  MgC12,  or  50  mM 
cacodylate. 
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r«14  115nM 
♦compl  1 1  5  nM 
r614  11  5nM 
•compl  1  15  nM 
r614  222  2nM 
•614  74  1  nM 
r«14  24  7nM 
r«14  8  2nM 


•  r61450nM 
♦compl  50  nM 

.    i€14  200nM 
♦compl  200  nM 

r€14  222  2nM 

•  n514  74  1nM 

•  nS14  24  7nM 
-    r614  8  2nM 


Figure  3-12.  Ribose-614  cleavage  is  unaffected  by  the  addition  of  10% 
complementary  strand,  but  addition  of  equimolar  amounts  greatly  lowers 
cleavage  rate  and  plateau.    (A)  Full  time  course,  and  (B)  initial  time  course,  for 
ribose-614  with  10%  added  complementary  strand.  (C)  Full  time  course,  and  (D) 
initial  time  course,  for  ribose-614  with  equimolar  added  complementary  strand. 
Ribose-614  was  incubated  with  exogenously  added  complementary  strand  at  either 
10%  of  the  total  amount  of  ribose  614,  or  at  equimolar  amount.  Each  reaction  was 
done  at  high  (115  nM)  and  low  (1 1.5)  nM  ribose-614  concentration  and  compared 
against  pure  ribose-614  at  a  range  of  concentrations. 
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c-614(200nM) 

c-614(200nM) 
+  unlabeled  d-614 
(400  nM)  at  143.5  hrs 

c-614  (200  nM)  diluted 
to8nMat  143.5  hours 


Figure  3-13.  614  cleavage  plateau  is  not  a  result  of  an  approach  to  chemical 
equilibrium.    614  was  allowed  to  react  until  it  approach  cleavage  plateau,  at  which 
point  it  was  either  diluted  25-fold,  or  incubated  with  2-fold  additional  unlabeled  d- 
614.  Neither  condition  altered  the  cleavage  plateau  of  614  in  a  time-dependent 
manner. 


r-614 

r-614  +  27-nt  product 
r-614  +  79-nt  product 
r-614  +  27-  &  79-nt  product 
r-library  +  27-nt  product 
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Figure  3-14.  Addition  of  the  27-nucleotide  and  the  79-nucleotide  cleavage 
products  alone  or  together  do  not  alter  the  cleavage  profile  of  614  (400  nM  each). 

Addition  of  the  79-nucleotide  cleavage  product  to  the  r-library  substrate  resulted 
increased  cleavage,  but  at  an  extremely  slow  rate.  This  cleavage  of  the  r-library 
substrate  in  the  presence  of  the  79-nt  fragment  is  lowered  by  the  addition  of  the  27-nt 
cleavage  product. 
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♦    r-614  (222.2  nM) 

«    r-614  (100  nM): 
uncleaved  r-614 
gel  purified 
at  140  hours 


hours 

Figure  3-15.  Gel-purification  of  614  at  cleavage  plateau  results  in  additional 
cleaveage,  indicating  a  fraction  of  614  is  folded  into  an  inactive  conformation. 

Ribose-614  was  allowed  to  self-cleave  for  140  hours,  nearing  cleavage  plateau,  at 
which  point  half  the  sample  was  gel-purified  on  a  denaturing  gel.  After  resuspending 
the  gel-purified  ribose-614  ("gp614")  in  reaction  buffer  to  100  nM,  additional 
cleavage  was  detected. 


r-614  (lOOnM) 

614  (100  nM):  heat 
denatured  at  141  hours 


200 


300 


400 


500 


hours 


Figure  3-16.  Reheating  ribose-614  results  in  additional  cleavage,  indicating  a 
fraction  of  614  is  folded  into  an  inactive  conformation.    Ribose-614  was  allowed  to 
react  until  it  reached  cleavage  plateau,  at  which  point  half  of  the  sample  was  denatured 
by  heating  to  97°C  for  3  minutes,  and  slowly  cooled  to  23 °C  over  10  minutes.  This  re- 
folding procedure  resulted  in  an  additional  10%  cleavage. 
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Figure  3-18.  Mutants  of  614  have  reduced  catalytic  activity.    (A)  Complete  time 
course;  (B)  Linear  initial  phase  of  time  course.  Mutations  of  614  were  common  in  the 
uncleaved  fraction  of  614  following  cleavage  plateau.  Kinetic  analysis  was  done  of 
many  of  these  clones  (all  at  -100  nM).  Most  mutations  greatly  reduced  catalytic 
activity  when  compared  to  wildtype  614. 
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Figure  3-19.  Mutants  of  614  have  reduced  activity  when  cleaving  r-lib61o2  in 
trans.    (A)  Complete  time  course;  (B)  Linear  initial  phase  of  time  course.  The  ability 
of  several  614  mutants  to  cleave  in  trans  was  tested  by  incubating  614  variants  (200 
nM,  unless  noted  differently)  with  r-lib61o2substrate  (7.5  nM,  unless  noted 
differently).  Most  mutations  greatly  reduced  catalytic  activity  when  compared  to 
wildtype  614.  Nucleotides  6-1 1  were  changed  to  AGTACT  to  create"d-6 1 4mut  A 
box." 
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Figure  3-20.  614  cleavage  rate  is  concentration  dependent.    (A)  Complete  time 
course  for  r-614  at  various  concentrations.  Rates  are  estimated  based  on  an 

exponential  fit  to  a  single  first-order  exponential  plateau  below  100%.  (B)  Linear 
initial  phase  of  time  course.  Rates  are  estimated  based  on  slope  of  best  fit  line. 
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Figure  3-21.  The  rate  of  r-614  cleavage  increases  at  higher  concentrations,  but 
reaches  a  lower  limit  at  low  dilutions.    Lowering  r-6 1 4  concentration  initially 
reduces  cleavage  rate,  but  the  cleavage  rate  reaches  a  lower  limit  of  approximately 
0.005/hour  at  concentrations  of  20  nM  and  below. 
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Figure  3-22.  Extrapolating  r-614  cleavage  rates  at  low  concentrations  to  infinite 
dilution  estimates  a  unimolecular  self-cleavage  rate  of  0.0059/hr.    (A)  The  initial 
rates  of  cleavage  for  614  at  various  concentrations  are  plotted  vs.  initial  r-614 
concentration  using  a  linear  scale.  (B)  A  best  fit  line  of  the  initial  rates  for  614 
cleavage  at  low  concentration  (<=  3.5  nM)  is  extrapolated  to  zero,  yielding  a  cis- 
cleavaee  rate  of  0.0059/hr. 
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Figure  3-23.  Cleavage  rates  for  low  concentrations  of  r-614  (but  not  high)  fit  well 
to  a  linear  equation  when  the  natural  log  of  substrate  concentration  is  plotted  vs 
time,  indicating  first  order  exponential  rate. 
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time  (hours)  time  (hours) 
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u        <  d-6 14  (full  length)  r-61  4  > 


cleaved  r-614  > 


<       full-length  r-primer       > 


•••••••• 


<         cleaved  r-primer  > 

*      •     •    ♦    ♦ 


Figure  3-24.  Both  r-614  and  d-614  cleave  r-primer  ("ang  +  ribose")  in  trans.    614 

was  internally  labeled  and  r-primer  was  5 '-end  labeled. 
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Figure  3-25.  Ribose-primer  competes  with  r-614  for  cleavage  by  r-614.    The 

cleavage  of  r-614  is  lowered  when  incubated  with  r-primer. 


■    r-primer  (7.5  nM) 
+  d-614(270nM) 

*  r-lib6lo2  (7.5  nM) 
+  d-614(270nM) 

»    r-library  (7.5)  nM 
+  d-614(270nM) 

♦  r-4up6  (7.5)  nM 
+  d-614(270nM) 


100 


hour 


Figure  3-26.  d-614  can  cleave  various  substrates  in  trans.    Unlabeled  d-614  (270 
nM)  was  incubated  with  various  radio-labeled  substrates  (7.5  nM),  resulting  in 
substrate  cleavage.  Each  substrate  contained  the  same  47-nucleotide  r-primer  region 
with  a  ribose-adenosine  at  position  27. 
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Figure  3-27.  Various  substrates  can  compete  with  r-614  for  cleavage  by  d-614. 

Addition  of  a  9-fold  excess  of  unlabeled  d-lib61o2  or  d-primer  to  radio-labeled  r-614 
(33  nM)  at  time  zero  lowered  cleavage  of  r-614.  Addition  of  the  same  amount  of 
unlabelled  d-614  to  labeled  r-614  increased  r-614  cleavage  to  levels  similar  to  that 
seen  with  300  nM  r-614.  Addition  of  d-4up6  increased  r-614  cleavage,  but  not  as 
much  as  the  addition  of  d-614. 


•  0.3  nM  r-614  +  3  nM  d-614 

♦  0.3r-614  +  3nMd-lib6b2 

*  0.33  nM  r-614 
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Figure  3-28.  The  cis-cleavage  rate  of  r-614  is  not  affected  by  excess  competitors. 

Radio-labeled  r-614  was  incubated  at  a  low  concentration  (0.33  nM)  so  as  to  minimize 
trans-cleavage.  Addition  of  9-fold  excess  d-lib61o2  competitor  did  not  affect  the 
cleavage  rate  of  r-614. 


104 


00- 

00- 

80- 

■ 

a 
• 

• 
■ 

• 

• 

70- 

A 

■ 

60- 

9 

■ 

50 

■ 

40- 

■ 

30- 

i. 

» 

• 

• 

B 


C 

0.050 

0.045 

0.040 

d     0.035 

3 

£     0.030 
■S     0.025 

2 

m    0020 

■ 

£     0.015H 

0.010 

0.005 

0.000 


100 

hours 


r-phmer  (4  nM)  cleavage  by: 

-    25nMd-614 
a    75nMd-614 
225  nM  d-614 

•  675nM*«14 

•  2025  nM  d-614 
p    6075  nM  d-614 


Substrate  = 

r-primer 

BMAX 

0.04076 

KD 

38.50 

R  squared 

0.8996 

2000  3000  4000 

[E  =  d614]  nM 


5000 


6000 


7000 


Figure  3-29.  Single-turnover  kinetics  for  d-614  cleaving  r-primer.    (A)  Full  time 
course.  (B)  Initial  (linear)  phase  of  cleavage.  (C)  Substrate  saturation  curve. 
Substrate  saturation  experiments  were  conducted  by  incubating  r-primer  (4  nM)  with 
increasing  amounts  of  d-614  enzyme. 
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Figure  3-30.  Single-turnover  kinetics  for  d-614  cleaving  r-614.    (A)  Full  time 
course.  (B)  Initial  (linear)  phase  of  cleavage.  (C)  Substrate  saturation  curve. 
Substrate  saturation  experiments  were  conducted  by  incubating  r-614  (4  nM)  with 
increasing  amounts  of  d-614  enzyme. 
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Figure  3-31.  Single-turnover  kinetics  for  d-614  cleaving  r-lib6lo2.    (A)  Fulltime 
course.  (B)  Initial  (linear)  phase  of  cleavage.  (C)  Substrate  saturation  curve. 
Substrate  saturation  experiments  were  conducted  by  incubating  r-lib61o2  (4  nM)  with 
increasing  amounts  of  d-614  enzyme. 


107 


B 


100-i 


hours 


Kexpfit  = 
0.04213 

003780 

004260 
0.02690 


Kexpfl 


0021 


0012 


00078 


0.0040 


30  nM  r-primer, 
300  nM  d-614 

10  nM  r-primer, 
100  nM  d-614 

3.3  nM  r-primer, 
33  nM  d-614 

1.1  nM  r-primer, 

11  nM  d-614 


22.5nMr-lib6lo2 
cleaved  by 
450  nM  d-614 

7.5  nM  r-lib6lo2 
cleaved  by 
150  nM  d-614 

2.5  nM  r-lib6to2 
cleaved  by 
50  nM  d-614 

0.8  nM  r-lib6k>2 
cleaved  by 
13  nM  d-614 


200 


250 


300 


hour 


Figure  3-32.  Single-turnover  kinetics  for  d-614  cleaving  r-primer  or  r-lib6lo2 
below  saturation. 
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Figure  3-33.  Multi-turnover  kinetics  for  d-614  cleaving  r-lib61o2  is  not  affected 
by  the  slow  cool  folding  protocol,  and  indicates  two  turnovers  in  200  hours.  (A) 

Percent  of  r-lib61o2  substrate  cleaved  by  d-614.  (B)  The  absolute  amount  of  substrate 
cleaved.  The  enzyme  and  substrate  were  mixed  together  either  immediately  before  or 
immediately  after  the  slow  cool  protocol. 


109 


A 

400 


300 


200- 


o 

3 

■o 

O 

i_ 

a 

I 

c 
o 

ca 


c 

o   100 

c 
o 

i 


fc_ 


100 


200 


300 


400 


hours 


400  nM  r-primer  + 
100nMd-614,  mixed 
before  slow  cool 

400  nM  r-primer  + 
100nMd-614,  mixed 
after  slow  cool 
133  nM  r-primer  + 
33  nM  d-614,  mixed 
before  slow  cool 

133  nM  r-primer  + 
33  nM  d-614,  mixed 
after  slow  cool 


500 


! 
l 

o 

o 

u 


B 

65 

60 

55- 

50- 

45- 

40 

35 

30 

25- 

20- 

15 

10 


■  * 


-i 1— 

10        20 


30 


40        50 

hours 


60 


400  nM  r-primer  + 
100  nM  d-614,  mixed 
before  slow  cool 

400  nM  r-primer  + 
100  nM  d-614,  mixed 
after  slow  cool 

133  nM  r-primer  + 
33  nM  d-614,  mixed 
before  slow  cool 

133  nM  r-primer  + 
33  nM  d-614,  mixed 
after  slow  cool 


70        80        90       100 


Figure  3-34.  Multi-turnover  kinetics  for  d-614  cleaving  r-primer  is  not  affected 
by  the  slow  cool  folding  protocol,  and  indicates  two  turnovers  in  100  hours.  (A) 

Full  time  course  for  r-primer  cleavage  by  d-614.  (B)  Initial  time  course  for  r-primer 
cleavage  by  d-614  reveals  linear  cleavage  up  to  50  hours.  The  enzyme  and  substrate 
were  mixed  together  either  immediately  before  or  immediately  after  the  slow  cool 
protocol. 
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Figure  3-35.  Multi-turnover  kinetics  for  d-614  cleaving  r-primer  suggest  the  first 
cleavage.  (A)  Full  time  course  for  r-primer  cleavage  by  d-614.  (B)  Initial  time  course 
for  r-primer  cleavage  by  d-614  reveals  linear  cleavage  up  to  50  hours.  The  enzyme 
and  substrate  were  mixed  together  either  immediately  before  or  immediately  after  the 
slow  cool  protocol. 
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Figure  3-36.  Intra-molecular  folding  of  r-614  is  reduced  by  the  omission  of  slow 
cooling.    Ribose-614  was  incubated  at  sufficiently  low  concentration  (2  nM  )  as  to 
favor  c/.9-cleavage  with  and  without  the  slow  cool  folding  protocol. 
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Figure  3-37.  Inter-molecular  folding  of  r-614  is  not  affected  by  "slow  cooling. 

Ribose-614  was  incubated  at  sufficiently  high  concentration  (200  nM  )  as  to  favor 
/ram-cleavage  with  and  without  the  "slow  cool"  folding. 


112 


40 


30 

■o 
■ 

I 

o 

£   20 

I 

B 

E 

10H 


o4 


♦ 

1 

♦ 

1 

400nM  r-lib6lo2  +  100  nM  d-614 

♦  mix,  then  slowcooi  (15C) 

*  mix,  then  slowcooi  (25C) 

■    slowcooi,  then  mix  (25C) 
»    mix  no  slowcooi  (25C) 

♦ 

■ 

i 

♦ 

i 

r 1 1 1 1 

10  20  30  40  50 

hour 


Figure  3-38.  Inter-molecular  folding  between  r-614  and  r-lib61o2  is  not  affected 
by  "slow  cooling."    A  four- fold  excess  of  substrate  r-lib61o2  was  mixed  with  d-614 
either  before  or  after  the  slow  cool  folding  protocol.  In  a  third  condition,  samples 
were  mixed  but  were  not  denatured  and  folded  using  the  slowcooi  protocol.  The  "mix, 
then  slow  cool"  condition  was  performed  in  parallel,  with  one  condition  at  15°C  while 
all  other  samples  were  incubated  at  25 °C.  The  slow  cool  folding  protocol  had  not 
effect  on  cleavage  rates.  Incubating  at  the  samples  at  1 5°C  initially  increased  substrate 
cleavage  compared  to  the  same  condition  at  25  °C. 
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Figure  3-39.  7>a/ts-cleavage  of  r-614  is  reduced  by  the  addition  of  cold  chase, 
indicating  that  the  rate  of  E»S  disassociation  is  faster  than  the  chemical  step  of 
cleavage.    Various  substrates  were  incubated  with  excess  enzyme  (614).  Following 
various  incubation  times,  excess  cold  chase  was  added  to  the  reaction.  The  addition  of 
chase  greatly  reduced  cleavage  of  the  substrate  (as  compared  to  an  identical  sample 
without  added  chase)  regardless  of  the  time  at  which  the  chase  was  added. 
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Figure  3-40.  The  residual  cleavage  following  the  addition  of  the  chase  is  less  than 
614  c«-cleavage,  and  is  dependent  on  the  amount  of  chase  added.  The  amount  of 
cleavage  following  the  addition  of  chase  was  re-plotted  by  subtracting  the  time  at 
which  the  chase  was  added  and  the  amount  of  cleavage  at  the  time  chase  was  added 
from  all  time/cleavage  data  point  following  the  chase.  Increasing  the  amount  of  chase 
added  to  the  r-primer(4  nM)/d-6 14(2025  nM)  condition  from  5  uM  to  10  uM 
decreased  the  residual  r-primer  cleavage. 
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Figure  41.  Ribose-614  fra/w-cleavage  is  increased  by  lowering  the  temperature. 

(A)  Full  time  course.  (B)  Plot  of  initial  rates  vs.  temperature.  Ribose-614  (200  nM) 
was  incubated  at  various  temperatures  from  4°C  to  42°C.  Temperatures  below  the 
selection  temperature  of  25  °C  increased  the  rate  of  cleavage,  indicating  that  the  rate 
limiting  step  is  the  folding/association  step  rather  than  the  chemical  step. 
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Figure  3-42.  Lowering  the  incubation  temperature  does  not  reduce  the  rate  of 
614  cleavage  acting  in  either  cis  or  trans.  (A)  Low  concentration  of  r-614  (2  nM, 
favoring  c/s-cleavage)  was  incubated  at  either  4°C  or  25 *C.  (B)  Low  concentration  of 
r-lib61o2  substrate  was  incubated  with  d-614  (100  nM)  at  either  15°C  or  25°C. 
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Figure  3-44.  Nucleotides  86-90  of  614  base-pair  with  nucleotides  22-26  of  either 
itself  (in  cis),  or  with  a  substrate  (in  trans).    .Mutations  were  introduced  into 
nucleotides  86,  88,  and  90  of  614,  resulting  in  reduced  cis  and  trans  cleavage  rates. 
Introducing  the  complementary  mutations  into  nucleotides  22,  24,  and  26  of  the 
substrate  improves  substrate  cleavage  both  in  cis  and  trans. 
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Figure  3-45.  Compensatory  mutations  predicted  by  various  structures  do  not 
improve  catalysis  of  614  mutated  at  nucleotide  55,  65,  or  72. 

Several  single  nucleotide  mutations  reduced  cleavage  rates.  Attempts  to  improve 
cleavage  rates  by  making  compensatory  mutations  predicted  by  several  structural 
models  were  unsuccessful. 
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Figure  3-46.  Several  structural  models  for  r-614  cleaving  r-primer  in  trans.    The 

two  experimentally  supported  trans-binding  structures  are  shown  in  colored  boxes. 
The  red  box  highlights  the  interaction  between  the  35CATG38  region  of  the  primer 
substrate  (S)  and  the  same  sequence  motif  of  614  (E).  The  yellow  box  highlights  the 
interaction  between  17CGACTCACTAT27  of  the  primer  substrate  and 
GTAGTGACG    of  6 1 4.  The  red  arrow  marks  the  ribose-A  site  of  cleavage. 
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Figure  3-47.  The  G38A  mutation  of  d-614  is  partially  "rescued"  by  the 
compensatory  C35T  mutation  in  the  r-lib61o2  substrate,  substantiating  the 
predicted  inter-molecular  "CATC  association.    Converting  G38  of  d-614  to  A 
greatly  reduced  /ra/w-cleavage  of  r-lib61o2,  but  /rans-cleavage  was  partially  recovered 
by  adding  the  compensatory  C35T  mutation  to  r-lib61o2.  Concentrations  used  were  d- 
614  (100  nM),  and  r-lib61o2  (1.5  nM). 
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Figure  3-48.  Structural  probing  using  DMS  and  potassium  permanganate  to 
chemically  modify  single-stranded  DNA  shows  some  protected  positions.  End- 
labeled  d-614  (300  nM)  was  incubated,  with  (lane  3  and  4)  or  without  (lane  1  and  2) 
unlabeled  r-primer  substrate  (900  nM)  overnight  in  reaction  buffer  to  allow  for  folding 
to  reach  equilibrium.  Samples  were  then  treated  with  either  DMS  or  potassium 
permanganate  for  2  minutes  or  5  minutes  to  modify  either  single-stranded  G-  or  T- 
nucleotides,  respectively.  Samples  were  then  ethanol  precipitated  and  treated  with  1M 
piperidine  for  1  hour  at  80°C.  Samples  were  again  ethanol  precipitated  resuspensded 
in  formamide  loading  buffer,  and  run  on  a  10%  PAGE/urea  gel  next  to  a  10-bp  ladder 
(lane  L).  Numbers  indicate  the  positions  of  G's  or  T's  in  the  complete  sequence 
(between  nucleotide  10  and  90).  Positions  that  demonstrated  some  degree  of 
protection  from  chemical  modification  are  indicated  with  the  red  box. 


CHAPTER  4 
STUDIES  OF  IN  VITRO  SELECTIONS 

In  Vitro  Selection  of  Functionalized  and  Non-Functionalized  Libraries  Using 
Caged-Ribose  and  Liquid-Phase  Selection 


In  Vitro  Selection 

Fifteen  in  vitro  selections  were  performed  in  parallel,  each  with  a  variation  in 
either  the  primer  sequence  ("ang"  vs.  "BJ"),  ribose  protection  ("caged"  vs.  unprotected 
free  ribose),  metal  cofactor  in  the  buffer  (none,  zinc,  or  magnesium),  the  number  of 
molecules  sampled  in  the  starting  library  (5  x  1012  vs.  5  x  1013),  or  the  presence  of 
functional  group  on  a  nucleic  acid  (E-base  vs.  standard  T-base).  The  selection  was 
performed  as  described  in  Materials  and  Methods.  Only  one  condition,  standard 
nucleotides  using  "ang  +  ribose"  primer  (with  unprotected-ribose)  generated  active 
DNAzymes  capable  of  phosphodiester  cleavage. 

For  the  initial  round  of  selection,  six  starting  libraries  were  generated  (varying  in 
catalytic  primer  sequence,  ribose  protection,  and  nucleotides)  via  a  single  cycle  of  primer 
extension  (internally  labeled  using  alpha-32P-CTP).  These  libraries  were  divided  into  the 
fifteen  selection  conditions  varying  in  buffer  composition  and  selection  time  (see  Table 
2-1).  All  samples  were  then  exposed  to  150-200  pulses  of  laser  (80-100  mJ/pulse,  the 
number  of  laser  pulses  was  varied  with  laser  power  so  that  the  total  energy  was  held 
constant  at  15,000  ml),  effectively  de-protecting  samples  containing  caged-ribose.  Pools 
were  then  allowed  to  react  for  20,  120,  or  720  minutes  (see  Table  1),  at  which  point  they 
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were  stopped  by  the  addition  of  formamide  stop  dye  and  freezing  (-20°C).  Aliquots  were 
also  removed  from  each  pool  and  stopped  at  various  times  (from  2  to  1 70  hours)  to  assess 
the  progress  curve  for  cleavage  at  each  round  of  selection.  The  majority  of  each  pool  was 
run  on  a  thick  sequencing  gel  and  the  cleaved  fraction  of  the  pool  was  excised  and 
purified.  The  purified  "survivors"  of  each  round  were  amplified  with  PCR  and  subjected 
to  an  additional  round  of  selection.  This  procedure  was  repeated  for  a  total  of  eight 
rounds  of  selection. 

Figure  4-1  shows  the  percent  of  full-length  substrate  for  each  selection  condition 
that  was  converted  to  the  cleaved  product  at  the  time  each  selection  was  stopped  at  each 
round  of  selection.  The  first  selection  is  termed  "round  zero"  because  it  is  the  selection 
of  the  initial  random  library  and  has  therefore  not  undergone  any  rounds  of  selection.  As 
expected  from  a  random  library,  inactive  molecules  dominate  the  library  in  the  early 
rounds  of  selection,  and  cleaved  substrate  is  therefore  nearly  undetectable.  Of  particular 
interest  are  samples  E  and  G.  Sample  E  contained  ten  times  the  starting  material  as  other 
samples,  and  sample  G  contained  more  radiolabel  than  other  samples,  and  for  these 
reasons,  both  samples  have  a  higher  signal-to-noise  ratio.  Both  samples  contained  the 
"BJ  +  cage"  primer  with  standard  nucleotides,  but  the  primer  in  sample  G  was  de- 
protected  by  laser  prior  to  the  initiation  of  the  IVS  experiments  (and  was  therefore 
effectively  "BJ  +  ribose"),  whereas  sample  E  was  protected  until  exposure  to  laser.  Both 
samples  were  exposed  to  laser  at  the  initiation  of  each  round  of  selection.  Both  samples 
were  allowed  to  react  for  two  hours  prior  to  the  addition  of  stop  buffer,  and  both  showed 
low  levels  of  conversion  to  the  cleaved  product.  This  low  level  of  cleavage  (less  than 
1%)  suggests  a  low  level  for  laser  damage. 
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Although  cleaved  products  were  undetectable,  samples  were  run  on  a  PAGE/urea 
gel  and  the  size  corresponding  to  cleaved  product  excised.  These  gel-purified  fragments 
were  used  as  templates  to  re-generate  full  length  DNAzymes  for  the  next  round  of 
selection  (indicating  that  small  amounts  of  cleaved  product  had  in  fact  been  produced  and 
isolated).  More  radiolabel  was  used  for  each  sample  in  all  subsequent  rounds,  increasing 
the  signal  to  noise  ratio.  Samples  A  -  F  all  show  a  noticeable  increase  in  percent  of  full 
length  sample  converted  to  cleaved  product  (-3-4%)  following  just  one  round  of 
selection.  All  these  samples  contained  the  "BJ  +  cage"  primer  with  standard  nucleotides, 
and  varied  only  in  either  the  buffer  composition  or  the  selection  time.  Other  selection 
conditions,  including  standard  nucleotides  with  "BJ  +  de-protected  cage"  or  "ang  + 
ribose"  primers,  and  all  samples  containing  E-base,  had  undetectable  levels  of  cleaved 
product.  After  the  completion  of  two  rounds  of  selection,  however,  selection  conditions 
containing  E-base  with  "BJ  +  cage"  primer  (samples  I  -  M)  began  to  show  2-4%  cleaved 
product;  parallel  selection  conditions  containing  standard  nucleotides  had  slightly  higher 
levels  of  cleaved  product,  -5%  (this  slight  increase  is  possibly  the  result  of  less 
"smearing"  of  the  samples  containing  standard  nucleotides).  Selection  conditions  with 
de-protected  caged-ribose  and  uncaged-ribose,  with  either  standard  or  non-standard 
bases,  still  showed  no  significant  cleavage  (G,  H,  N,  O). 

The  percent  of  full  length  substrate  converted  to  cleaved  product  for  each  sample 
A  -F  (containing  "BJ  +  cage"  with  standard  bases,  regardless  of  the  buffer  composition 
and  selection  time)  continued  to  increase  slightly  with  each  progressive  round  of 
selection.  By  the  completion  of  7  rounds  of  selection,  as  much  as  1 1%  of  these  samples 
were  converted  to  full-length  product.  During  the  same  time,  selection  conditions  with 
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the  "BJ  +  de-protected  cage"  primer  (with  standard  and  non-standard  bases)  continued  to 
have  insignificant  conversion  to  cleaved  product. 

Selection  condition  "H,"  containing  standard  nucleotides  with  the  "ang  +  ribose" 
primer,  began  to  show  3.7%  cleavage  after  four  rounds  of  selection.  The  amount  of 
cleaved  product  for  selection  condition  "H"  increased  dramatically  from  3.7%  after  4 
rounds  of  selection,  to  8%  after  5  rounds,  and  to  24.3%  after  seven  rounds  of  selection. 
The  parallel  selection  to  population  "H"  in  which  E-base  was  substituted  for  T-base  had 
no  detectable  cleavage  even  after  seven  rounds  of  selection. 

The  results  reported  above  tell  us  only  what  fraction  of  the  full  length  material 
was  cleaved  at  the  time  each  round  of  selection  was  stopped.  A  more  detailed 
understanding  of  the  progress  of  each  selection  can  be  gained  by  examining  the  progress 
curve  for  cleavage.  This  was  done  by  removing  an  aliquot  of  each  selection  condition  at 
various  times  (from  2  hours  to  1 70  hours)  and  stopping  the  reaction  by  formamide  stop 
dye.  A  low  signal-to-noise  ratio  made  base-line  determination  difficult,  and  therefore 
estimation  of  the  percent  cleaved  difficult.  Nonetheless,  the  presence  of  DNAzyme 
enrichment  in  our  library  should  be  manifest  by  in  an  increase  in  cleaved  product  with 
time,  whereas  noise  should  remain  relatively  constant  over  time. 

Of  the  fifteen  selection  conditions,  only  selection  condition  "H"  showed  a 
consistent  increase  in  cleaved  product  with  time.  Other  selection  conditions  had  varying 
levels  of  cleavage,  but  cleavage  did  not  consistently  increase  with  time.  It  is  notable  that 
cleavage  was  not  detected  in  samples  G,  N,  and  O,  all  of  which  had  an  unprotected  ribose 
at  the  time  of  laser  exposure  (either  "BJ  +  de-protected  cage"  or  "ang  +  ribose"  primer). 
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Testing  Kinetics  from  Round  8  Pools  Without  Caged-ribose 

These  data  suggest  that  the  laser  may  cause  background  cleavage  that  masks  the 
cleavage  resulting  from  DNAzyme  catalysis.  This  was  tested  by  amplifying  the  survivors 
of  the  eighth  round  of  selection  for  each  selection  condition  without  a  caged-ribose  (using 
either  "BJ  +  ribose"  or  "ang  +  ribose")  and  testing  kinetics  without  the  laser  initiation. 
Figure  4-2  shows  the  results.  The  pool  of  molecules  generated  from  population  "H" 
demonstrates  a  dramatic  increase  in  cleavage  with  increasing  time.  None  of  the  other 
selection  conditions,  however,  show  any  detectable  cleavage.    This  suggests  that  the 
minimal  cleavage  seen  in  the  kinetics  profiles  completed  during  the  IVS  is  in  fact 
background  caused  by  the  combined  presence  of  the  caged-ribose  and  laser. 

Testing  for  Inadvertent  Selection  for  Susceptibility  to  Laser 

In  light  of  the  slight  increase  in  the  amount  of  cleaved  product  observed  with 
increasing  rounds  of  selection  for  all  samples  containing  a  caged  ribose,  it  is  conceivable 
that  the  IVS  with  caged  ribose  and  laser  actually  selected  for  molecules  that  are  more 
susceptible  to  cleavage  by  laser  in  the  presence  of  a  caged-ribose.  To  test  this  possibility, 
pools  from  round  1 ,  4  and  8  for  selection  conditions  "D"  and  "H"  were  re-amplified  with 
caged-ribose  (using  "BJ  +  cage"  and  "ang  +  cage"  primers  respectively).  Each  sample 
was  diluted  to  15  nM  and  exposed  to  laser.  Each  of  the  six  samples  had  less  than  0.25% 
cleavage  prior  to  laser.  Immediately  following  laser  exposure,  pools  from  "H"  rounds  1, 
4,  and  8  showed  3.4,  3.5,  and  3.0%  cleavage;  pools  from  "D"  rounds  1,  4,  8  showed  5.6, 
5.4,  and  5.3%  cleavage.  The  amount  of  each  pool  cleaved  immediately  following  laser 
did  not  increase  from  round  1  to  4  to  8,  arguing  against  the  possibility  that  there  was 
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selection  for  molecules  that  are  more  susceptible  to  laser  induced  cleavage.  When  the 
kinetics  of  these  samples  was  examined  with  increasing  time,  cleavage  of  "H  round  1" 
did  not  increase  significantly,  whereas  "H  round  4"  and  "H  round  8"  showed  increasing 
cleavage  similar  to  samples  prepared  with  "ang  +  ribose"  and  without  laser  initiation 
(Figure  4-3).  Pools  from  "D"  rounds  1,  4,  and  8  did  not  show  significant  increase  in 
cleavage  with  time  after  the  immediate  increase  after  laser  exposure. 

Caged-ribose  Does  Not  Induce  Cleavage  in  trans 

The  results  described  in  the  previous  sections  suggest  that,  although  the  laser 
alone  does  not  cause  significant  cleavage  of  ribose-614,  it  appears  as  though  the  presence 
of  a  caged-ribose  in  addition  to  laser  causes  a  low  level  of  sequence-independent 
cleavage  at  the  caged-ribose  (no  mechanism  for  this  reaction  is  known).  It  remains 
unclear,  however,  if  the  combination  of  laser  and  caged-ribose  is  causing  strand  breakage 
of  the  strand  to  which  the  caged-ribose  is  attached  (in  cis),  or  whether  cleavage  is  in 
trans.  We  tested  this  possibility  by  mixing  unlabeled  caged-ribose  primer  with 
internally-labeled  ribose-lib61o2,  and  then  exposing  the  combined  sample  to  laser.  The 
immediate  cleavage  seen  previously  when  the  caged-ribose  was  internal  to  the  labeled 
strand  was  not  detected  when  the  caged-ribose  was  in  trans  to  the  labeled  ribose- 
substrate,  demonstrating  that  the  cleavage  effect  of  caged-ribose  and  laser  is  localized  in 
cis  to  its  own  phosphodiester  backbone. 

Kinetic  Analysis  of  Rounds  1-8  Pools  for  "H:  ang  +  ribose" 

Pools  for  "H"  were  re-amplified  from  the  stock  of  "survivors"  following  each 
round  of  selection  and  used  for  a  more  detailed  kinetic  analysis.  Samples  were 
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synthesized  with  an  unprotected  ribose,  and  therefore  laser  was  not  used  to  initiate 
cleavage.  Pools  from  each  of  the  eight  rounds  of  selection  were  prepared  at  three 
concentrations:  3  nM,  15  nM  (approximately  the  concentration  used  during  the  IVS),  and 
60  nM.  Figure  4-4  shows  the  full  kinetic  profile  for  the  pool  of  molecules  following  each 
round  of  selection.  From  this  it  can  be  seen  there  is  no  detectable  activity  in  the  pool  of 
molecules  following  only  one  round  of  selection.  A  slight  increase  in  pool  activity  is 
seen  following  only  the  second  round  of  selection,  and  this  activity  dramatically  increases 
progressively  after  third,  fourth,  and  fifth  round  of  selection.  By  the  completion  of  the 
fifth  round  of  selection,  the  activity  of  the  pool  had  reached  an  apparent  maximum,  after 
which  further  selection  no  longer  increased  the  rate  of  product  formation  from  the  pool. 

Figure  4-5  shows  the  initial  rates  of  cleavage  for  each  of  the  three  dilutions.  In  all 
cases,  a  slight  increase  in  cleavage  was  seen  with  each  successive  increase  in 
concentration.  This  indicates  that  cleavage  rate  of  these  pools  is  occurring  in  trans,  but 
the  amount  of  trans  cleavage  must  be  low  as  the  rate  increases  no  more  than  3  fold  when 
the  concentration  increases  20  fold. 

Pools  from  "H  round  8"  where  tested  over  a  temperature  range  to  determine 
whether  the  rate  limiting  step  was  temperature  dependent.  Figure  4-6  shows  that 
cleavage  rates  are  unchanged  by  a  drop  of  temperature  from  25°C  to  15°C.  Further 
lowering  the  temperatures  lowers  the  rate  of  product  formation,  although  less  than 
expected  for  a  reaction  which  is  rate-limited  by  the  chemical  step.  Like  614,  the  rate  of 
cleavage  for  the  pool  of  molecules  from  selection  H  following  8  rounds  of  selection 
appears  to  reflect  a  balance  between  the  association  step  and  than  the  chemical  step. 
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7ra«s-cleavage  was  tested  directly  by  incubating  5 '-labeled  "ang  +  ribose"  primer 
with  the  pool  derived  from  "H  round  7"  (Figure  4-7).  The  presence  of  equimolar  "ang  + 
ribose"  primer  substrate  slightly  reduced  cleavage  of  molecules  in  the  "H.Round#7"  pool, 
while  resulting  in  nearly  75%  primer  cleavage.  Nine-fold  excess  of  "ang  +  ribose"  over 
"H"  pool  reduced  the  cleavage  of  molecules  in  the  "H"  pool  by  half,  while  resulting  in 
five  times  more  primer  cleavage  (the  percent  of  primer  cleaved  was  reduced  in  half,  but 
since  the  total  amount  of  primer  present  was  9-fold  higher,  the  net  result  is  an  increased 
amount  of  product  formation). 

Selection  condition  "H"  contained  1  M  NaCl  and  1  mM  MgCl2  in  the  reaction 
buffer.  It  was  assumed  by  Breaker  and  Joyce,  and  most  subsequent  researchers,  that  the 
molecules  generated  using  IVS  with  MgCl2  in  the  buffer  would  utilize  the  magnesium  as 
a  cofactor  (it  was  never  overtly  stated,  but  it  is  possible  molecules  that  are  Mg- 
independent  would  be  lost  during  the  work-up  phase  of  selection  and  therefore  selected 
against).  Following  this  assumption,  molecules  would  remain  inactive  until  the 
magnesium  cofactor  was  added.  Our  experience  with  614  demonstrated  that  molecules 
selected  with  magnesium  as  a  cofactor  can  in  fact  be  magnesium-independent. 

Pools  generated  from  "H  round  7"  showed  similar  kinetics  regardless  of  whether 
Zn  was  substituted  for  magnesium,  or  in  fact  if  no  divalent  cation  was  present  (Figure 
4-8A).  To  remove  the  possibility  that  trace  metal  contamination  contributed  to  catalysis 
in  the  absence  of  magnesium  or  zinc  in  the  reaction  buffer,  new  buffers  were  made  with 
double-distilled  water  and  ultra-pure  NaCl.  Cleavage  was  unchanged  by  the  removal  of 
MgCl2,  the  addition  of  EDTA,  or  the  addition  of  DEPC  (used  in  previous  buffers)  to  the 
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reaction  buffer.  No  catalysis  was  observed,  however,  when  the  NaCl  was  omitted  from 
the  reaction  buffer,  even  with  the  addition  of  1  mM  MgC^  (Figure  4-8B). 

Sequence  Evolution  During  IVS 

The  kinetic  profiles  of  pool  "H"  show  a  progressive  increase  in  catalytic  power 
following  the  third,  fourth,  and  fifth  rounds  of  selection,  with  no  further  increase 
following  subsequent  rounds  of  selection.  The  evolution  of  the  populations  during  the 
selections  was  examined  by  following  the  progression  of  sequence  variation.  Individual 
species  from  pools  of  survivors  following  rounds  1,  3,  4,  5,  and  8  of  selection  were 
cloned  and  sequenced. 

None  of  the  eighteen  clones  sequenced  from  the  round  one  pool  were  similar  to 
each  other,  and  only  one  was  similar  to  sequences  isolated  from  later  rounds.  In  contrast, 
most  of  the  sequences  isolated  from  later  rounds  of  selection  (pools  following  rounds  3, 
4,  5,  and  8)  fell  into  four  Families,  termed  alpha,  beta,  gamma,  and  delta  (Figure  4-9). 
Families  beta,  gamma,  and  delta  share  two  common  sequence  motifs,  both  in  the  3'  end 
oftheN40  region. 

The  high  degree  of  sequence  variation  seen  in  the  "round  one  pool"  is  expected, 
and,  with  the  observation  that  this  pool  shows  very  little  catalytic  activity,  is  consistent 
with  the  idea  that  "survivors"  in  this  pool  are  dominated  by  non-catalytic  molecules  that 
"leaked"  through  the  selection  system  (either  by  nonspecific  cleavage,  trans-cleavage,  or 
abnormally  fast  migration  during  gel  electrophoresis). 

As  catalytic  activity  of  the  pool  begins  to  increase  following  the  third  round  of 
selection,  examples  of  the  sequence  Family  alpha  and  beta  begin  to  dominate.    Examples 
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of  the  Family  gamma  and  delta  sequences  begin  to  show  up  following  the  fourth  round  of 
selection.  All  representatives  of  the  Family  alpha  are  eliminated  from  our  sampling  in 
rounds  five  and  subsequent  rounds,  and  all  of  the  representatives  of  the  delta  Family  are 
eliminated  by  the  eighth  round  of  selection.  The  extinction  of  Family  alpha  and  Family 
delta  may  suggest  that  these  sequence  Families  have  low  catalytic  activity  compared  to 
other  sequences  with  which  they  are  in  competition  (or  perhaps  Families  went  extinct 
because  they  were  poor  substrates  for  PCR). 

Although  sequences  clearly  cluster  into  Families,  few  of  the  sequences  are  exactly 
identical  to  any  other  sequence.  This  similarity  of  sequences  within  a  Family  can  either 
be  the  result  of  convergence  onto  similar  sequences,  assuming  each  sequence  was 
originally  present  in  the  initial  random  library,  or  through  divergence  from  a  single 
sequence  via  mutations  accrued  during  PCR  amplification. 

The  sequences  of  Family  beta,  gamma,  and  delta  are  also  very  similar  to 
sequences  614,  615,  and  616  (respectively)  isolated  from  a  previous  selection  experiment. 
The  infrequency  of  these  large  "motifs"  (1  in  1021)  makes  it  extremely  unlikely  that  such 
motifs  would  be  sampled  from  a  random  library  (containing  1024  potential  variants) 
multiple  times  (either  within  a  single  experiment  or  in  two  separate  experiments)  when 
each  sample  contains  only  1012  molecules.  Several  possibilities  could  explain  this  result. 

(a)  The  library  is  not  truly  random:  If,  rather  than  containing  the  expected  1024 
unique  molecules,  a  bias  in  the  synthesis  or  purification  of  the  library  greatly 
reduces  the  actual  library  size  to  (for  example)  only  1013  different  molecules,  than 
the  odds  of  recovering  the  same  sequence  in  each  of  two  independent  samplings 
of  1012  molecules  is  relatively  high. 

(b)  PCR  generated  bias.  Creating  the  full-length  product  requires  that  only  40 
nucleotides  be  synthesized  between  the  two  primers.  Given  that  one  primer  is  49- 
nucleotides  long,  and  the  other  1 8-nucleotides  long,  it  is  conceivable  (although 
not  likely)  that  the  primers  may  mis-prime  off  of  either  the  randomized  template 
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or  off  of  one  another  to  generate  products  of  approximately  100  nucleotides. 
Successive  rounds  of  PCR  and  selection  based  on  size  and  catalytic  ability  may 
isolate  molecules  that  were  never  present  in  the  initial  sample. 

(c)  Contamination  from  an  external  source.  DNA  from  environmental  sources  (such 
as  pollen,  microbes,  or  the  humans  conducting  the  experiments)  may  have 
contaminated  both  IVS  experiments. 

(d)  Contamination  from  an  internal  source.  Catalytic  sequences  isolated  from  the 
previous  IVS  may  have  contaminated  equipment  or  buffers,  therefore  biasing  the 
library  of  the  second  IVS. 

Because  it  is  difficult  to  imagine  a  mechanism  that  would  result  in  either  scenario  (a)  or 

(b),  contamination  seems  to  be  the  most  likely  explanation  for  finding  extremely  rare 

catalytic  sequences.  A  few  factors,  however,  argue  against  contamination.  First,  the 

second  IVS  was  performed  by  different  researchers,  at  different  times,  in  different 

buildings,  with  different  equipment.  Second,  our  sequencing  studies  of  614  revealed  that 

mutations  occurred  very  infrequently  (less  than  1  out  of  8  sequences  contained  a 

mutation)  after  20  cycles  of  PCR.  If  the  Family  beta  sequences  were  derived  from 

contamination  by  614,  one  would  expect  to  find  many  exact  copies  of  614,  particularly  in 

early  rounds  of  the  selection.  Yet  of  the  12  Family  beta  clones,  eleven  differed  from  614 

by  at  least  one  nucleotide.  Finally,  the  event  that  led  to  contamination  is  itself  rare;  the 

chance  of  it  occurring  three  separate  times  (one  for  each  sequence)  is  considerably  more 

unlikely. 

Kinetic  Analysis  of  Round  8  Clones 

Over  half  the  clones  isolated  following  round  8  were  very  similar  to  614  (Family 
beta  and  gamma),  yet  the  rate  constant  for  cleavage  of  the  round  8  pool  was  significantly 
faster  than  614.  The  difference  could  be  because  the  average  rate  of  all  sequence  variants 
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in  pool  8  is  faster  than  that  of  the  614  sequence.  This  was  explored  by  testing  the  rates  of 
c/s-cleavage  of  various  clones  isolated  from  the  round  8  pool  (Figure  4-9).  Cleavage 
assays  were  done  at  12  nM,  the  same  concentration  used  during  the  selections. 

Clone  Hr#8.161  differed  from  614  by  only  one  nucleotide,  yet  the  rate  constant 
was  2-fold  lower  (0.0059  hr"1),  whereas  other  clones  (Hr#8.58,  Hr#8.53,  Hr#8.162) 
differing  from  614  and  each  other  by  only  one  or  two  nucleotides  had  rate  constants  ten- 
fold higher  than  614  (-0.15  hr"1).  Two  Family  gamma  sequences  (Hr#8.56  and 
Hr#8. 169)  differed  from  each  other  by  only  three  nucleotides,  yet  there  rate  constants 
differed  by  an  order  of  magnitude  (0.12  vs.  0.017  hr"1).  The  clones  from  round  8  that  did 
not  fall  into  one  of  the  four  major  sequence  Families  all  had  very  low  rate  constants 
(Hr#8.170  was  the  fastest,  with  a  rate  of  0.0013  hr"1). 

Simulations  of  In  Vitro  Selection  Experiments 
To  understand  the  role  that  catalytic  nucleic  acids  may  have  played  in  the  origins 
of  life,  we  must  be  able  to  quantify  the  power  and  abundance  of  catalytic  activity  in  a 
random  pool  of  nucleic  acid  polymers.  Previous  IVS  experiments  have  supported  the 
expectation  that  catalytic  nucleic  acids  ("catalytic"  in  regards  to  any  specific  reaction)  are 
"rare."  We  are  limited  in  our  ability  to  directly  estimate  the  frequency  of  "rare"  catalysts 
by  limitations  of  experimental  methods  used  to  detect  them.  Detection  of  catalysis  most 
frequently  relies  on  an  electrophoretic  size  shift  of  a  radio-labeled  catalyst-substrate 
conjugate.  Alternatively,  a  sample  of  molecules  could  be  isolated  via  cloning  and 
sequencing,  and  unique  sequences  could  be  tested  individually  for  catalytic  activity 
(within  an  order  of  magnitude,  102  is  largest  practical  sample  size  for  this  method).  If 
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catalysts  were  abundant  in  the  initial  library,  catalysis  could  be  detected  above 
background  by  electrophoretic  size  shift  of  the  library  itself,  or  by  examining  100 
individual  molecules  randomly  selected  from  the  library  via  cloning.  Both  methods, 
however,  could  not  yield  a  positive  result  if  catalysis  were  less  frequent  than 
approximately  one  part  per  hundred.  In  the  first  method,  background  would  obscure  less 
abundant  catalysts;  in  the  second  method,  insufficient  sampling  would  miss  the  catalysts. 
Being  that  catalysts  are  much  more  rare  than  one  per  hundred,  multiple  rounds  of 
selection  must  be  performed  to  enrich  catalysts. 

Most  IVS  experiments  are  designed  to  isolate  the  "best"  catalysts  in  the  library. 
Isolating  a  few  of  the  (presumably)  "best"  catalysts  tells  us  very  little  about  the  likelihood 
of  generating  function  from  a  random  library.  Indeed,  the  question  of  greatest 
importance  is,  "How  many  catalysts  were  there  in  the  initial  pool,  and  what  was  the 
distribution  of  their  catalytic  power  (rate  constants)?"  To  do  this  we  must  extrapolate 
from  what  we  can  observe  following  multiple  rounds  of  selection. 


Simulating  In  Vitro  Selections 


This  section  describes  of  a  "play  forward"  model  for  IVS  which  can  be  used  to 
infer  information  about  an  initial  library  based  on  data  gathered  following  multiple 
rounds  of  selection.  The  model  begins  by  estimating  values  for  key  parameters  for  an 
initial  random  library,  and  then  simulating  multiple  rounds  of  selection.  This  allows  us  to 
determine  the  impact  of  various  initial  conditions  on  observable  outcomes  following 
selection.  The  parameters  of  the  model  are  on  the  following  page. 
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SL        Size  of  initial  library 

RF       Reactive  Fraction 

NRF     Non-reactive  Fraction:  NRF  =  SL*(\  -  RF) 

k„         Intrinsic  rate  of  molecule  "«":  each  member  of  the  initial  library,  n,  is 
assigned  an  intrinsic  rate,  k„,  which  relates  to  its  ability  to  be  cleaved 
under  selection  conditions.  The  intrinsic  rate  is  sequence-dependent  and 
therefore  unchanging. 

kshwest  The  rate  constant  for  the  slowest  catalysts  in  the  library. 

kfastest    The  rate  constant  of  the  fastest  molecule  in  the  library. 

'Alifen  The  intrinsic  rate  of  molecule  "n"  (k„)  expressed  in  terms  of  half-life: 

'Mifen=\n2lkn 
St         Selection  time:  the  amount  of  time  allowed  for  a  molecule  to  cleave  itself 

and  therefore  achieve  "survival." 
m  or  b  Distribution  function  for  reactive  molecules.  Two  functions  have  been 

tested  thus  far,  a  linear  function  with  slope  =  m,  and  an  exponential 

function  with  decay  rate  =  b. 
L  Leakage:  the  fraction  of  the  pool  that  "survives"  a  round  of  selection 

independent  of  St  and  intrinsic  rate  constant. 

Model  of  survival:  first  order  chemical  rate  based  on  a  molecules  intrinsic  rate 
constant  (k„)  and  the  selection  time  (St),  P(cleavage)  =  exp(-£„  *  St) 

During  an  actual  in  vitro  selection,  the  only  data  that  can  (easily)  be  obtained 

experimentally  are  "fraction  cleaved"  vs.  "time"  for  the  entire  pool  at  each  round  of 

selection.  Once  the  progress  curve  is  determined  for  each  round  of  selection,  data  can  be 

extracted  to  make  other  useful  plots  that  illustrate  the  progress  of  the  entire  selection 

process.  One  such  plot  compares  the  fraction  cleaved  at  time  =  t  for  increasing  rounds  of 

selection.  If  "time"  equals  the  "selection  time,"  this  plot  shows  the  amount  of  sample 

cleaved  at  the  completion  of  each  round  of  selection  (the  surviving  fraction).  It  may  also 

be  useful  to  plot  the  fraction  cleaved  at  a  time  less  than  the  selection  time,  particularly 

when  comparing  IVS  experiments  performed  with  various  selection  times,  or  when 

comparing  IVS  experiments  in  which  nearly  100%  of  the  fraction  has  been  cleaved  at  the 

completion  of  each  round  (t  =  St).    From  these  data  we  would  like  to  estimate  the 
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distribution  of  rates  at  each  round,  and  if  possible,  extrapolate  from  these  measurements 
the  distribution  of  rates  in  the  initial  library. 

The  model  begins  by  simulating  an  initial  pool  composed  of  1012  unique 
molecules  (roughly  the  size  of  the  initial  library  screened  in  a  typical  IVS).  This  initial 
pool  is  then  divided  into  a  "Reactive  Fraction"  (RF)  and  a  "Non-reactive  Fraction" 
(NRF).  The  NRF  is  defined  as  the  portion  of  the  library  that  does  not  have  any  intrinsic 
catalytic  activity  above  the  background  cleavage  rate  (for  our  experimental  system,  the 
background  cleavage  rate  is  ~10"7  min"1,  based  on  the  experimentally  determined 
cleavage  rate  of  ribose  by  magnesium  ion  (Breaker  and  Joyce,  1995). 

All  reactive  molecules  in  the  initial  library  (RF)  are  assigned  randomly  a  discrete 
rate  category  (in  this  simulation  2 1  rate  categories  were  used)  between  ksiowest  and  kfastest 
(the  intrinsic  rate  for  each  category  increases  proportionally  between  ksiowes,  and  kfaslest). 
The  lower  limit  for  catalytic  rate  constants,  ksiowesh  is  set  to  100-times  faster  than  the 
background  rate  (this  lower  limit  is  set  both  by  the  methods  of  detecting  a  difference 
between  background  and  signal  and  the  duration  of  a  kinetics  experiment:  we  must  detect 
1%  cleavage  above  background  in  1000  minutes,  corresponding  to  a  lower  limit  of  ~10"5 
min"  ).  The  rate  constant  of  the  fastest  molecule  represented  in  the  initial  library,  kfaslest, 
must  be  determined  experimentally  as  it  is  expected  to  depend  on  the  particular  catalytic 
reaction  and  library  being  tested.  Previous  research  suggests  an  upper  limit  for 
DNAzyme  cleavage  of  a  phosphodiester  bond  in  the  region  of  0.1  min"1.  Unless 
otherwise  stated,  kfastes,  was  therefore  set  to  0.1  min"1. 

Very  little  is  known  about  the  likely  distribution  of  catalytic  rates  in  a  random 
library;  indeed  the  distribution  of  catalytic  abundance  versus  catalytic  rate  and  may  be 
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apparently  chaotic.  If  the  distribution  of  catalytic  function  can  in  fact  be  approximated 
with  a  function,  it  seems  reasonable  that  faster  catalysts  will  be  less  abundant  than  slower 
catalysts,  and  therefore  a  decreasing  function  should  be  used.    The  simulations  described 
here  have  examined  only  two  of  these  functions:  a  linear  and  an  exponential. 

The  final  parameter  considered  in  this  model  is  "leakage,"  defined  as  the  fraction 
of  the  pool  at  each  round  of  selection  that  survives  that  round  of  selection  independent  of 
both  its  intrinsic  rate  (sequence  dependent)  and  the  St.  As  a  first  approximation,  leakage 
can  be  considered  the  background  rate  of  cleavage  by  the  magnesium  ion  at  pH  7. 
However,  experimental  deficiencies  are  likely  to  be  a  more  significant  contributor  to 
leakage  than  the  background,  magnesium-dependent  cleavage.  Leakage  is  therefore 
introduced  to  accommodate  for  deficiencies  of  the  experimental  system,  and  background 
cleavage  is  considered  separately. 

In  selections  using  biotinylated  DNAzymes  bound  to  a  streptavidin  column, 
leakage  results  from  molecules  that  are  eluted  from  the  column  after  the  initiation  of  the 
selection.  This  leakage  may  arise  from  molecules  that  never  were  never  bound  in  the  first 
place  (withheld  in  the  matrix),  molecules  uncleaved  but  released  due  to  incomplete 
streptavidin/biotin  interaction,  or  molecules  cleaved  by  the  NaOH  wash.  IVS  based  on 
size  separation  using  gel  electrophoresis  can  have  leakage  resulting  from  incomplete  size 
separation  such  that  some  gel  extraction  inadvertently  collects  incorrect  size  molecules. 
In  our  IVS  system,  leakage  may  also  be  the  result  of  laser  induced  cleavage. 

Survival  of  any  particular  molecule  is  based  on  its  probability  of  being  cleaved 
given  a  "Selection  time"  (St),  and  is  therefore  independent  of  other  molecules.  The 
probability  of  being  cleaved,  P(cleavage),  is  modeled  assuming  that  each  catalysts 
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approximately  follows  first  order  chemical  kinetics  and  therefore  is  determined  solely  by 
a  molecule's  intrinsic  rate  (k„,  as  assigned  by  the  initial  distribution  function,  either  linear 
or  exponential)  and  time  (t)  following  the  equation  P(cleavage)  =  exp(-k„  *  St). 

Our  previous  analysis  of  614  demonstrated  that  DNAzymes  do  not  necessarily 
follow  first  order  kinetic  models.    7>a«s-cleavage  is  in  fact  possible.  Therefore,  some 
cleavage  is  second  order  and  concentration  dependent.  Does  this  invalidate  our  model 
which  assumes  first  order  kinetics?  Only  slightly.  7>a/zs-cleavage  for  614  was 
significant  only  at  high  concentrations;  the  effective  molarity  of  c/s-cleavage  was 
approximately  25  nM,  suggesting  that  c/s-cleavage  predominates  below  25  nM. 
Therefore,  if  selections  are  performed  at  low  concentrations,  m-cleavage  will 
predominate,  and  our  first  order  kinetic  approximation  is  acceptable.  Future  models  may 
be  improved  to  consider  bi-molecular  reactions.  Alternatively,  IVS  experiments  may  be 
designed  to  eliminate  /ra/zs-cleavage  (perhaps  by  re-engineering  the  binding  motif  in  the 
constant  region  of  the  library). 


Using  Simulations  to  Determine  the  Impact  of  the  Leakage  (L)  Parameter  on  IVS 

Initial  simulations  were  done  to  determine  the  influence  of  factors  such  as  leakage 
(L),  selection  time  (St),  maximum  rate  (kfastesd>  and  the  initial  abundance  of  reactive 
molecules  (RF)  on  the  cleavage  progress  curve  and  population  distribution.  Most  of 
these  initial  simulations  were  done  using  the  simplest  distribution  function,  a  linear 
distribution  with  m  =  0,  but  the  conclusions  can  be  generalized  to  other  distribution 
functions. 
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Figure  4- 1 0  shows  the  results  of  simulations  for  three  test  cases,  each  differing  in 
the  value  of  the  leakage  parameter  (L,  ranging  from  zero  to  4%).  Panel  A  shows  the 
initial  distribution  for  each  population  (the  population  was  distributed  according  to  a 
linear  function  with  m  =  0  for  all  three  test  cases).  Panel  E  plots  experimentally 
observable  data  -  the  progress  curve  for  each  test  case  following  successive  rounds  of 
selection  (the  progress  curve  is  a  plot  of  the  fraction  of  the  population  cleaved  with 
increasing  time).  From  the  progress  curve,  the  amount  of  each  population  cleaved  at  time 
=  selection  time  (St),  or  time  =  8 1  minutes,  was  extracted  for  each  round  and  plotted  in 
Panel  B  and  C.  Although  during  the  experimental  implementation  of  a  selection  we  can 
only  observe  the  progress  curve,  the  simulations  allow  us  to  observe  the  population 
distribution  at  each  round.  These  data  are  plotted  in  Panel  D  (the  intrinsic  rate,  k„,  for 
each  molecule  has  been  converted  into  its  corresponding  half  life,  so  a  rate  of  .00231 
min"1  is  plotted  as  a  'Alife  of  300  minutes). 

Most  of  the  molecules  in  the  initial  library  are  non-reactive,  and  therefore  are 
cleaved  at  the  background  rate  of  only  10"7  min"1.  The  fraction  of  the  population  cleaved 
at  the  end  of  the  first  round  of  selection  (t  =  St)  -  referred  to  as  "survivors"  -  is  therefore 
extremely  low,  and  mostly  a  reflection  of  leakage.  With  each  successive  round  of 
selection,  the  catalytic  molecules  are  enhanced  while  the  non-reactive  fraction  is 
diminished.  Eventually  the  catalytic  molecules  reach  sufficient  abundance  relative  to  the 
non-reactive  molecules  that  they  surpass  the  threshold  of  detection,  defined  as  1%  above 
background  cleavage  (at  time  =  St). 

Although  each  case  shown  in  Figure  4-10  had  the  same  initial  distribution  and 
abundance  of  catalysts,  increasing  the  amount  of  leakage  from  zero  to  0.2%  delayed  first 
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detection  of  survivors  from  round  3  to  round  4.  Increasing  the  leakage  parameter  even 
further  to  4%  delayed  the  earliest  detection  of  survivors  to  round  7.  This  indicates  that 
the  leakage  parameter  plays  a  significant  role  in  determining  the  number  of  rounds  of 
selection  required  to  achieve  detectable  levels  of  catalysis. 

With  sufficient  rounds  of  selection,  each  test  case  eventually  reaches  a 
"survivorship  plateau"  -  a  point  beyond  which  further  rounds  of  selection  no  longer 
significantly  increases  the  fraction  of  the  population  surviving  each  round  (Panel  B). 
Panel  C  shows  a  similar  plateau  for  the  amount  cleaved  at  t  =  8 1  minutes  (it  is  useful  to 
plot  the  amount  cleaved  at  a  set  time  for  conditions  where  populations  are  selected  with 
various  selection  times).  In  this  simulation,  each  test  case  reaches  a  survivorship  plateau 
near  100%  cleaved.  Panel  D  shows  the  distribution  of  catalysts  for  each  test  case  at 
successive  rounds  of  selection.  Comparing  Panel  B  (or  C)  with  Panel  D  illustrates  that  at 
the  same  round  the  survivorship  begins  to  plateau,  the  distribution  of  catalysts  in  the 
selected  populations  reach  a  stable  state  -  referred  to  as  the  "stabilized  population"  - 
where  the  proportionality  between  catalytic  rates  of  surviving  molecules  remains  constant 
even  with  additional  rounds  of  selection.  Once  this  survivorship  reaches  a  plateau,  there 
is  no  selective  enhancement  of  faster  molecules  over  the  remaining  slower  molecules, 
resulting  in  the  stabilization  of  the  surviving  population. 

Case  1  achieves  a  survivorship  plateau  of  100%  by  round  4  (panel  B),  and  panel 
D  shows  that  the  population  distribution  for  case  1  remains  relatively  constant  from 
round  4  through  the  end  of  the  simulation  (round  9).  Case  2  and  case  3  achieve  a 
survivorship  plateau  of  100%  much  later  than  case  1  (round  6  and  9,  respectively)  as  a 
result  of  a  higher  leakage  parameter  (L  =  0.2%  and  4%,  respectively).  Despite  the 


142 

additional  rounds  required  to  reach  a  survivorship  plateau,  the  population  distribution  for 
all  test  cases  converges  to  the  same  population  distribution  (this  occurs  when  there  is  no 
longer  selective  enhancement  for  faster  molecules  and  corresponds  to  the  survivorship 
plateau).  This  is  manifested  by  the  convergence  of  progress  curves  for  all  test  cases 
(once  each  test  case  reaches  a  survivorship  plateau).  This  demonstrates  that,  although  the 
leakage  parameter  dramatically  affects  the  number  of  rounds  necessary  to  isolate 
catalysts,  it  does  not  alter  the  distribution  of  selected  catalysts  (following  sufficient 
rounds  of  selection). 

The  distribution  of  selected  catalysts  in  the  stabilized  population  bears  an 
interesting  relationship  to  the  selection  time.  There  are  three  distinct  population  zones 
defined  in  relation  to  the  St:  zone  A  is  comprised  of  molecules  with  half-lives  larger 
(slower)  than  St;  zone  B  is  comprised  of  molecules  with  half-lives  between  St  and 
approximately  St/6  (6-fold  faster);  and  zone  C  is  comprised  of  molecules  with  half-lives 
between  St/6  and  kfastest- 

Although  in  this  simulation  all  molecules,  regardless  of  their  rate  constant,  were 
initially  present  in  equal  amounts,  molecules  in  zone  A  survive  with  increasingly  lower 
probability  than  faster  molecules  (zone  B  and  C).  Eventually  the  surviving  population 
reaches  a  point  where  molecules  from  zone  A  are  effectively  absent.  Unless  their  initial 
abundance  is  exceedingly  high  relative  to  faster  molecules,  molecules  of  zone  A  are 
quickly  eliminated  from  the  selected  population. 

In  fact,  given  an  initial  population  consisting  catalysts  with  one  of  only  two 
possible  intrinsic  rates,  one  with  rate  equal  to  St  {'/2lifefast  =  St),  and  the  other  ten  times 
slower  ('/ilife&t  =  10*5/),  the  slower  catalysts  must  be  7.5-fold  more  abundant  than  the 
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faster  catalyst  in  order  for  both  catalysts  to  be  in  equal  abundance  following  one  round  of 
selection.  If  three  round  of  selection  are  required,  the  initial  abundance  of  the  slower 
catalysts  needs  to  be  (7.5)3  =  416  times  more  abundant  than  the  faster  catalyst.  The 
situation  is  even  more  dramatic  for  "very  slow"  catalysts:  if  a  very  slow  catalysts  is  100 
times  slower  ('/2lifeverysiow  =  100*50  than  the  fast  catalyst  ('Alife/as,  =  St),  then  this  very 
slow  catalyst  must  be  in  72-fold  greater  abundance  in  the  initial  pool  to  survive  in  equal 
proportion  as  the  faster  catalyst  following  one  round  of  selection,  and  380,000-fold  (= 
72)  greater  abundance  to  survive  three  rounds  of  selection.  Because  IVS  routinely 
require  5  to  15  rounds  of  selection,  molecules  with  half  lives  greater  than  St  are  likely  to 
be  completely  absent  from  the  population  by  the  time  any  cleavage  above  background  is 
detected.  For  the  simulation  shown  in  Figure  4-10,  all  molecules  in  zone  A  are 
eliminated  by  round  3. 

Zone  B  corresponds  to  an  area  of  active  enrichment  where  molecules  with  faster 
rates  are  enhanced  disproportionably  greater  than  molecules  with  slower  rates.  With 
sufficient  rounds  of  selection,  the  slower  molecules  fall  below  the  level  of  detection  (and 
hence  zone  B  becomes  more  skewed  towards  faster  catalysts  and  zone  A  enlarges  to 
include  an  area  that  was  formerly  part  of  zone  B). 

Zone  C  corresponds  to  the  situation  describe  above  in  which  all  the  molecules  are 
sufficiently  fast  relative  to  St  ('Alife  <  St/6)  such  that  nearly  all  molecules  are  cleaved  by 
the  completion  of  each  round  of  selection  and  no  selective  enhancement  exists. 
Consequently,  molecules  in  zone  C  continue  to  persist  without  any  change  in  their 
relative  proportion  to  each  other.  These  properties  of  the  population  distribution  can  be 
seen  in  Figure  4-10,  or  in  more  detail  in  Table  4-1. 
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TABLE  4-1  A.  Probability  of  survival  following  successive  rounds 
of  selection  (based  on  intrinsic  rate  and  selection  time) 


st  = 

half  life 

min) 

300 

1200 

600 

300 

150 

75 

50 

37.5 

18.75 

9.375 

4.688 

intial  rate  (min'1] 
6E- 

X  round 

04 

0.001 

0.002 

0.005 

0.009 

0.014 

0.018 

0.037 
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0.148 

of  selection: 
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0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

0.16 

0.29 

0.50 

0.75 

0.94 

0.98 

1.00 

1.00 

1.00 

1.00 

2 

0.03 

0.09 

0.25 

0.56 

0.88 

0.97 

0.99 

1.00 

1.00 

1.00 

3 

0.00 

0.03 

0.13 

0.42 

0.82 

0.95 

0.99 

1.00 

1.00 

1.00 

4 

0.00 

0.01 

0.06 

0.32 

0.77 

0.94 

0.98 

1.00 

1.00 

1.00 

5 

0.00 

0.00 

0.03 

0.24 

0.72 

0.92 

0.98 

1.00 

1.00 

1.00 

6 

0.00 

0.00 

0.02 

0.18 

0.68 

0.91 

0.98 

1.00 

1.00 

1.00 

7 

0.00 

0.00 

0.01 

0.13 

0.64 

0.90 

0.97 

1.00 

1.00 

1.00 

8 

0.00 

0.00 

0.00 

0.10 

0.60 

0.88 

0.97 

1.00 

1.00 

1.00 

9 

0.00 

0.00 

0.00 

0.08 

0.56 

0.87 

0.97 

1.00 

1.00 

1.00 

10 

0.00 

0.00 

0.00 

0.06 

0.52 

0.85 

0.96 

1.00 

1.00 

1.00 

TABLE  4-1 B.  Population  distribution  following  successive 
rounds  of  selection. 


300 


X  round 
of  selection: 
0 


half  life  (min) 
1200        600 
intial  rate  (min'1) 
6E- 

04  0.001  0.002 
percent  of  population 
10.00      10.00      10.00 
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75 


50        37.5      18.75      9.375      4.688 


1 

2.09 

3.84 

6.56 

9.84 

2 

0.37 

1.27 

3.70 

8.32 

3 

0.06 

0.40 

1.97 

6.65 

4 

0.01 

0.12 

1.03 

5.20 

5 

0.00 

0.04 

0.53 

4.02 

6 

0.00 

0.01 

0.27 

3.09 

7 

0.00 

0.00 

0.14 

2.36 

8 

0.00 

0.00 

0.07 

1.80 

9 

0.00 

0.00 

0.04 

1.37 

10 

0.00 

0.00 

0.02 

1.04 

0.005  0.009 
after  X  rounds  of 

10.00  10.00 
12.30 
12.99 
12.99 
12.70 
12.27 
11.79 
11.27 
10.75 
10.23 
9.72 


0.014      0.018 
selection: 

10.00      10.00 


12.92 
14.33 
15.04 
15.44 
15.67 
15.80 
15.86 
15.88 
15.87 
15.83 


13.07 
14.67 
15.58 
16.18 
16.62 
16.96 
17.23 
17.46 
17.65 
17.82 


0.037      0.074      0.148 


10.00 
13.12 
14.78 
15.77 
16.44 
16.95 
17.36 
17.71 
18.01 
18.28 
18.52 


10.00 
13.12 
14.78 
15.77 
16.44 
16.95 
17.36 
17.71 
18.01 
18.28 
18.53 


10.00 
13.12 
14.78 
15.77 
16.44 
16.95 
17.36 
17.71 
18.01 
18.28 
18.53 


=  initial  population  distribution 
zone  A  =  below  level  of  detection  (2%) 

zone  B  =  population  in  flux 

zone  C  =  initial  proportionality  is  unchanged  (within  90%) 
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Examining  the  Impact  of  the  Selection  Time  (St) 

Figure  4-11  shows  the  results  of  another  simulation  in  which  the  selection  time 
(St)  was  varied  (St  =  15,  246,  and  16922  minutes  for  case  1,  2,  and  3  respectively)  while 
all  other  parameters  were  held  constant.  Increasing  the  selection  time  from  15  minutes  to 
246  initially  resulted  in  a  larger  fraction  of  the  population  being  cleaved  at  the  end  of 
each  round  of  selection,  corresponding  to  earlier  initial  detection.  Increasing  the 
selection  time  further  to  16922  minutes  reversed  this  trend,  demonstrating  that  the  round 
at  which  catalysts  are  first  detected  (at  t  =  St)  can  vary  according  to  the  selection  time. 
When  the  selection  time  is  very  short,  very  few  molecules  meet  the  selection 
requirement,  and  thus  more  rounds  are  required  to  isolate  these  infrequent  molecules. 
Extending  the  selection  time  increases  the  number  of  molecules  that  may  survive,  and 
this  may  reduce  the  number  or  rounds  required  to  isolate  catalysts.  However,  as  the 
selection  time  is  increased  further,  the  average  rate  of  the  surviving  population  remains 
low,  and  more  rounds  are  required  to  detect  cleavage  in  the  population. 

As  seen  in  the  simulation  with  varying  leakage  rates,  the  fraction  of  the 
population  cleaved  at  a  given  time  reaches  a  plateau  with  sufficient  rounds  of  selection, 
indicating  that  the  selections  isolated  stabilized  populations  (Figure  4-11,  Panel  B). 
When  the  selection  time  is  short  relative  to  the  half  life  of  the  fastest  catalyst,  a 
population  of  only  the  fastest  catalysts  is  isolated.  This  short  selection  time  also  results  in 
a  plateau  below  95%  for  the  fraction  of  the  population  cleaved  at  the  end  of  the  round  of 
selection  (cleavage  at  time  t  =  St).  Because  there  is  no  significant  differential  enrichment 
for  catalysts  with  half-life  6-fold  slower  than  the  selection  time,  longer  selection  times 
equally  enrich  all  molecules  with  a  half-life  (rate)  approximately  6-fold  lower  than  the 
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selection  time.  This  produces  a  stabilized  population  that  includes  catalysts  with  half 
lives  (rates)  ranging  from  St/6  to  kfastest-  When  the  selection  time  is  very  short,  only  the 
fastest  molecules  in  the  population  have  a  high  probability  of  surviving  each  round; 
therefore,  multiple  rounds  of  selection  eliminate  all  but  the  fastest  molecules  (rather  than 
isolating  a  population  with  a  range  of  rates). 

Examining  the  Impact  of  the  Rate  of  the  Fastest  Catalysts  (kjh^) 

Another  simulation  was  performed  in  which  the  half-life  of  the  fastest  catalyst  in 
the  initial  population  was  examined  from  'Mife  =  6  minutes  to  60  and  246  minutes,  while 
holding  other  parameters  of  the  model  constant  (parameters  shown  in  Figure  4-12).  The 
number  of  catalysts  with  any  given  half-life  greater  than  the  selection  time  was  held 
constant  at  10"6  for  each  test  case,  while  the  distribution  of  catalysts  with  half  lives  less 
than  the  selection  time  varied.  The  population  of  catalysts  in  case  1  was  distributed 
according  to  a  linear  function  with  slope  of -1.2  x  10~6  (for  all  catalysts  with  half  lives 
between  St  and  kfastesA)-  The  population  of  catalysts  in  case  2  was  distributed  according 
to  the  proportionality  established  in  case  1  (for  all  catalysts  with  half  lives  between  St  and 
k/astes^),  while  holding  the  reactive  fraction  constant  (RF  =  10"6).  The  catalysts  for  case  2 
were  distributed  (for  rates  between  St  and  kfastes,2)  according  to  the  proportional  changes 
in  catalysts  in  case  1  and  holding  the  reactive  fraction  the  same  (10~6).  All  of  the  catalysts 
for  case  3  were  distributed  into  a  single  rate  class  with  rate  =  kfastest3  and  reactive  fraction 
equal  to  10"6. 

Figure  4-12  shows  that  reducing  the  rate  constant  of  the  fastest  catalyst  in  a 
population  (kfaslest  is  lower,  and  therefore  slower  catalyst)  has  a  similar  effect  as 


147 

increasing  the  selection  time  on  the  population  distribution  following  selection.  Because 
selections  enrich  for  molecules  with  half  lives  (rates)  6-fold  less  than  selection  time, 
when  the  kfastes,  is  reduced  to  within  6-fold  of  the  selection  time,  only  molecules  with  rate 
=  kfastest  survive.  This  indicates  that  given  sufficient  rounds  of  selection  as  to  attain  a 
stabilized  population,  the  distribution  of  catalysts  in  the  stabilized  population  is 
determined  in  part  by  a  combination  of  factors  including  both  the  fastest  catalysts  in  the 
initial  library  and  the  selection  time. 

Examining  the  Impact  of  the  Reactive  Fraction 

Simulations  were  performed  for  initial  populations  that  varied  in  either  the 
reactive  fraction,  the  slope  of  the  distribution  of  catalysts,  or  both  Figure  4-13).  The 
catalysts  in  case  1  and  case  2  were  both  distributed  according  to  the  same  linear  function 
(with  m  =  -4.5  x  10"9),  but  case  2  had  100  times  more  reactive  molecules  in  its  initial 
population  (RF  =    10"6  rather  than  10~8).  Not  only  did  the  selections  for  case  2  reveal 
catalysts  at  an  earlier  round  of  selection,  but  the  distribution  of  catalysts  in  the  stabilized 
populations  differed  from  the  distribution  for  case  1 . 

Both  stabilized  populations  for  case  1  and  2  have  a  decreasing  trend  (less  catalytic 
abundance  with  increasing  catalytic  rate)  as  in  the  initial  library,  but  case  1  decreased 
more  rapidly  than  case  2.  Reasoning  that  it  was  not  the  slope  of  the  initial  distribution 
function,  but  rather  the  ratio  between  faster  and  slower  molecules  established  by  the 
initial  distribution  function  and  the  reactive  fraction,  that  determined  the  distribution  of 
catalysts  in  the  stabilized  population,  a  third  case  was  tested.  This  third  case  differed 
from  case  1  in  both  the  number  of  reactive  catalysts  (RF  =  10"6vs  10"8)  and  in  the 
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distribution  of  those  catalysts  (m  =  -4.5  x  10"7  vs.  -4.5  x  10"9),  but  the  ratio 
("proportionality")  of  catalysts  with  rate  =  k„  to  catalysts  with  rate  =  k„+i  (where  n  equals 
any  given  rate)  was  held  constant  (for  a  linear  distribution,  the  "proportionality" 
relationship  is  mi  =  (mi*RF3)  / RFi). 

Case  3  reached  a  stabilized  population  earlier  than  case  1  on  account  of  a  100-fold 
greater  reactive  fraction,  but  once  stable,  both  populations  had  identical  distributions  of 
catalytic  rates.  The  reactive  fraction  therefore  alters  the  number  of  rounds  of  selection 
necessary  to  achieve  stabilized  populations,  but  the  distribution  of  catalysts  for  these 
stabilized  populations  does  not  depend  on  the  initial  reactive  fraction.  Instead,  the 
distribution  of  catalysts  for  a  stabilized  population  mirrors  the  proportional  relationship 
between  catalysts  in  the  initial  population  for  all  catalysts  with  rates  6-fold  faster  than  the 
selection  time. 

Comparing  Linearly  and  Exponentially  Distributed  Initial  Populations 

This  prompted  another  simulation  in  which  the  RF  in  the  intial  library  was 
distributed  according  to  a  decreasing  exponential  function  with  b  =  25  (Figure  4-14,  case 
3).  Catalysts  in  case  1  and  2  were  distributed  according  to  a  linear  function  (with  m  =  0 
and  -1.2  x  10"  ,  respectively)  as  modeled  in  earlier  simulation.     Although  case  1  and  case 
2  approach  population  stability  in  nearly  identical  manners,  the  difference  between  their 
population  distributions  can  be  detected  by  examining  cleavage  profile  at  early  times. 
The  distribution  of  catalysts  in  the  stabilized  population  differs  dramatically  for  each  test 
case.  In  each  case,  the  proportionality  between  catalysts  with  half  lives  less  than  St/6 
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mirrors  the  proportionality  in  the  initial  population.  This  relationship  can  be  seen  in 
Figure  4- 1 5  and  Table  4- 1 . 

The  approach  to  a  stabilized  population  (selection  plateau)  is  slower,  and  the  peak 
of  catalytic  distributions  for  selected  populations  is  shifted  closer  to  the  St,  for  the 
exponential  distribution.  Both  factors  indicate  that  the  population  is  still  enriching  for 
faster  molecules;  the  slow  enrichment  is  a  result  of  an  increased  distribution  of  catalysts 
with  half  lives  between  6*St  and  St/6  compared  to  catalysts  with  half  life  <St/6  in  the 
initial  distribution.  Although  the  stabilized  population  gives  little  information  about  the 
distribution  of  catalysts  with  half  lives  between  6*St  and  St/6,  the  rapidity  at  which 
increasing  rounds  of  selection  approaches  selection  plateau  may  be  used  as  an  indicator 
of  the  initial  distribution  of  catalysts  in  this  range. 

Deciphering  the  Distribution  Function  for  Catalysts 

Given  that  during  an  actual  in  vitro  selection,  the  only  data  that  can  (easily)  be 
obtained  experimentally  are  "fraction  cleaved"  vs.  "time"  for  the  entire  pool  at  each 
round  of  selection,  how  then  can  we  estimate  the  initial  distribution  of  catalytic  power  in 
library?  The  only  experimentally  measurable  variable  is  the  progress  curve  for  cleavage 
at  each  round  of  the  selection  process.  From  this  we  can  determine  the  progress  of  the 
entire  selection  by  plotting  the  amount  cleaved  at  the  completion  of  each  round  (t  =  St). 
When  this  selection-progress  plot  reaches  a  plateau,  we  know  we  have  isolated  a 
stabilized  population.  Further  rounds  of  selection  will  not  significantly  alter  the 
proportion  of  catalysts  in  this  stabilized  population,  and  of  equal  more  importance,  the 
proportionality  of  this  population  directly  mirrors  the  initial  library  (for  catalysts  with 
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half  lives  <  St/6).  If  the  selection-progress  plot  indicates  a  plateau  below  95%  cleavage, 
the  stabilized  population  contains  only  the  fastest  molecules.  How  do  we  use  the 
information  gained  from  these  simulations  to  estimate  the  population  distribution  for  a 
real  library  undergoing  selection? 

The  first  thing  we  must  consider  is  the  range  of  rates  we  are  interested  in 
estimating.  Unless  significant  activity  can  be  detected  in  only  a  few  rounds  (less  than 
three),  nearly  all  molecules  with  half  lives  greater  than  the  St  will  be  completely  lost  and 
have  little  effect  on  the  outcome  of  the  selection  procedure.  (Although  slow  catalysts 
may  be  completely  lost,  their  initial  abundance  will  compete  with  faster  catalysts  for  PCR 
amplification,  thus  diluting  the  relative  abundance  of  the  faster  catalysts  and  slightly 
delaying  their  detection.  Unless  these  catalysts  with  half  lives  close  to,  but  above  St,  are 
in  enormous  excess  over  catalysts  with  half  lives  just  below  St,  this  delay  is  unlikely  to 
make  any  difference.  Nonetheless,  the  "play  forward"  model  described  below  considers 
this  minor  effect). 

Choosing  selection  time  near  the  half  life  of  the  fastest  molecules  will  isolate  only 
the  fastest  molecules,  while  choosing  a  longer  selection  time  results  in  a  stabilized 
population  which  contains  a  broad  range  of  catalysts  with  rates  less  than  the  St. 
Practically  speaking,  multiple  selections  should  be  performed  with  various  selection 
times  to  determine  both  the  fastest  catalysts  in  the  population  and  to  isolate  a  range  of 
catalytic  rates. 

We  can  take  advantage  of  the  observation  that  there  is  no  selective  enrichment  for 
catalysts  with  half  lives  greater  than  selection  time,  and  therefore  the  distribution  of 
catalysts  with  half  lives  less  than  St/6  is  proportional  to  their  proportion  in  the  initial 
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library,  to  estimate  the  distribution  of  catalysts  in  the  initial  library.  The  proximity  of  the 
peak  for  catalytic  distribution  is  also  indicative  of  the  relative  proportion  of  catalysts  with 
half  lives  between  St  and  St/6.  This  method  relies  on  determining  the  population 
distribution  at  a  least  one  round  (preferably  a  round  shortly  following  population 
stabilization).  The  population  distribution  at  a  round  can  be  determined  experimentally 
by  sampling  individual  catalysts  (by  cloning)  and  individually  assessing  their  individual 
rates.  Sampling  approximately  100  individuals  should  give  a  reasonable  estimation  of 
the  entire  population  distribution. 

It  may  also  be  possible  to  estimate  population  distribution  using  statistical 
analysis  of  the  cleavage  progress  curve.  This  population  progress  curve  is  the  sum  of  the 
progress  curves  for  each  catalyst  in  the  population.  The  Prony  technique  is  a  statistical 
approach  to  deconvoluting  the  sum  of  many  exponential  curves  into  its  individual 
components.  This  approach  is  similar  (in  concept)  to  the  way  a  Fourier  transform 
deconvolutes  a  complex  signal  into  the  sum  of  many  sine  functions.  The  Prony  technique 
varies  both  the  exponential  rate  and  scaling  parameters  of  multiple  exponential  functions 
to  achieve  an  optimal  the  fit  to  the  experimental  data. 

Both  statistical  and  experimental  methods  can  be  used  to  estimate  the  population 
distribution  for  the  stabilized  population.  The  distribution  of  catalysts  in  the  initial 
population  is  directly  proportional  to  the  catalysts  in  the  stabilized  population  for  all 
catalysts  with  half  lives  less  than  ~St/6.  Although  both  techniques  will  estimate  the 
abundance  of  catalysts  with  half  lives  greater  than  St/6  in  the  stabilized  population,  this 
unfortunately  provides  no  information  about  the  distribution  of  these  catalysts  in  the 
initial  population.  The  irrelevance  of  catalysts  with  half-lives  greater  than  St/6  is 
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particularly  disheartening  to  the  person  who  individually  tested  the  rates  of  100  clones 
(particularly  if  the  majority  of  molecules  sample  have  rates  greater  than  St/6,  requiring 
that  more  individual  catalysts  be  sampled). 

An  alternative  technique  for  estimating  the  initial  population  distribution  would 
be  useful.  Rather  than  "working  backward"  from  the  stabilized  population,  the 
simulation  model  developed  here  can  be  "played  forward"  with  varying  starting 
parameters  to  optimize  fit  to  the  experimentally  observed  data  (most  notably,  the  round  at 
which  cleavage  is  first  detected,  the  percent  of  the  population  cleaved  at  selection 
plateau,  and  the  rate  at  which  the  selection  approaches  selection  plateau).  Only  two 
parameters  need  to  be  varied,  the  distribution  and  size  of  the  reactive  fraction  (leakage 
and  kfastest  can  be  determined  experimentally  and  therefore  do  not  need  to  be  estimated). 
This  method  offers  the  advantage  that  it  considers  the  impact  of  catalysts  with  half  lives 
greater  than  St/6,  and  uses  data  from  every  round  of  selection  (rather  than  just  the 
stabilized  population  of  a  single  round). 

Summary  of  Results 
Table  4-1  (next  page)  summarizes  the  major  results  of  the  experiments  and 
simulations  described  in  this  chapter. 
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Table  4- 1 .  Summary  of  Results  from  Chapter  4 


IVS  experiment 

•  Cleaved  products  were  less  than  1%  of  the  total  after  the  first  round  of  selection. 

•  The  amount  of  cleaved  products  in  selections  with  caged-ribose  increased 

steadily  with  successive  rounds  of  selection  to  -10%  by  round  7. 

•  The  amount  of  Selection  H  increased  dramatically  after  the  fourth  round  of 

selection. 

•  Eight  rounds  of  selections  for  samples  containing  the  caged-ribose  did  not 

significantly  enrich  for  catalysts.  Selections  with  caged-ribose  did  not  show 
time-dependent  cleavage,  and  when  pools  from  Round  1,  4,  and  8  were  re- 
synthesized  with  caged-ribose,  the  only  cleavage  occurred  immediately  after 
laser  photolysis. Pools  from  selections  with  caged-ribose  following  Round  8 
were  re-synthesized  with  unprotected  ribose,  but  did  not  show  any  time- 
dependent  cleavage. 

•  Selection  H  resulted  in  significant  enrichment  for  catalysts. 

•  Several  of  the  catalysts  isolated  from  Selection  H  were  nearly  identical  to 

catalysts  isolated  in  previous  selection  experiments,  but  single-nucleotide 
variants  possessed  rates  that  different  by  an  order  of  magnitude. 

•  The  bulk  pool  of  catalysts  from  Selection  H  is  Mg++  independent. 

IVS  Simulations 

•  The  leakage  parameter  dramatically  affects  the  number  of  rounds  or  selection 

necessary  to  achieve  enrichment  for  active  catalysts. 

•  The  relative  abundance  of  catalysts  compared  to  non-reactive  molecules  (the 

Reactive  Fraction)  influences  the  number  of  rounds  necessary  to  detect 
catalysts. 

•  The  selection  time  parameter  influences  the  number  of  rounds  of  selection 

required  to  first  detect  active  catalysts. 
•When  the  amount  of  cleavage  of  a  pool  at  the  end  of  each  round  of  selection 
reaches  a  plateau,  the  population  has  reached  a  stable  point  where  the 
distribution  of  catalytic  rates  remains  largely  unchanged  with  more  rounds  of 
selection 

•  If  the  selection  time  is  short  relative  to  the  half-life  of  the  fastest  catalysts  in  a 

library,  the  stabilized  population  will  reach  a  cleavage  plateau  below  -95% 
cleaved  at  the  end  of  successive  rounds  of  selection,  and  the  pool  will  be 
enriched  for  only  the  fastest  catalysts  of  the  initial  library. 

•  If  the  selection  time  is  large  relative  to  the  half-life  of  the  fastest  catalysts  in  a 

library,  the  stabilized  population  will  contain  a  distribution  of  catalytic  rates 
that  is  proportional  to  the  distribution  of  the  initial  library. 
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Figure  4-1  Cleavage  at  the  completion  of  each  round  of  selection  (time  varies  according  to  selection 
condition).  "Round  0"  refers  to  the  initial  library  before  any  rounds  of  selection.  All  selections  were 
exposed  to  laser  at  the  beginning  of  each  round  of  selection.  The  survivors  of  each  round  were 
gel-purified  and  amplified  for  the  next  roundof  selection.  The  amount  cleaved  for  samples  with 
caged-ribose  (A-F,  l-M)  increased  progressively  with  each  round.  Selections  G,  N.  and  O  never 
increased.  Selection  H  increased  dramatically  following  the  fourth  round  of  selection.  The  percent 
cleaved  at  the  completion  of  each  round  is  shown  below  each  sample.  Refer  to  Table  2-1  (page  34)  for  a 
a  description  of  each  selection  condition. 
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Figure  4-2.  Cleavage  assays  for  all  selections  conditions  after  8  rounds  of 
selection  reveal  that  only  selection  H  has  abundant  catalysts.    Pools  from  all 
selection  conditions  were  prepared  with  ribose-A  primer  (either  "BJ  +  ribose"  or  "ang 
+  ribose").  Cleavage  assays  were  initiated  by  adding  2X  reaction  buffer  and  the  slow 
cool  protocol  (rather  than  laser  initiation).  Only  selection  condition  H  had  any 
detectable  cleavage  following  49  hours.  Time  points  1-6  are  0,  2,  3,  1 1,  23.3,  and  48.7 
hours. 
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Figure  4-3.  Selection  H  shows  similar  cleavage  rates  with  ribose-  or  caged-ribose 
primer.  Pools  of  catalysts  were  generated  using  either  ribose-adenosine  or  caged- 
adenosine  from  the  survivors  of  Selection  H  following  round  1,4,  and  8.  Samples 
containing  the  caged-adenosine  were  initiated  with  laser. 
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Figure  4-4.  Kinetic  profiles  for  Selection  H  at  various  concentrations  following 
each  round  of  selection  shows  detectable  cleavage  following  Round  3  of  selection. 

DNA  from  the  survivors  of  each  round  of  Selection  H  was  re-amplified  from  stocks 
and  progress  curves  for  cleavage  assayed  at  3,  15,  and  60  nM. 
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Figure  4-5.  The  initial  rate  of  cleavage  for  Selection  H  following  rounds  4  -  8 
increases  slightly  with  increased  concentrations  of  DNA.  DNA  from  the  survivors 
of  each  round  of  Selection  H  was  re-amplified  from  stocks  and  progress  curves  for 
cleavage  assayed  at  3,  15,  and  60  nM. 
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Figure  4-6.  The  cleavage  rate  for  Selection  H  following  8  rounds  of  selection  is 
unchanged  by  incubating  at  15°C,  but  is  slightly  lower  at  temperatures  below 
15°C.  The  rate  drops  quickly  when  the  temperature  is  increased  above  25 °C.  (A)  Full 
time  course.  (B)  Plot  of  initial  rate  vs.  temperature. 
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Figure  4-7.  Selection  H  following  round  7  cleaves  r-primer  in  trans.  Selection  H 
(30  nM)  following  round  7  was  incubated  with  either  30  nM  or  270  nM  r-primer. 
Cleavage  of  both  Selection  H  DNAzymes  and  the  r-primer  substrate  was  determined. 
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Figure  4-8.  Cleavage  of  Selection  H  pool  DNAzymes  following  round  7  is  metal- 
independent.  (A)  Selection  H  cleavage  was  examined  in  1M  NaCI  and  50  mM 
HEPES  pH  7  plus  either  Zn++,  Mg++,  or  no  divalent  metal.  (B)  Selection  H  cleavage 
was  examined  in  50  mM  HEPES  buffer  pH  7,  plus  1  M  NaCI,  1  mM  MgC12,  1  uL 
DEPC  per  mL  buffer,  or  3  mM  EDTA. 
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CHAPTER  5 
SUMMARY 

Our  current  understanding  of  the  origins  of  life  suggests  that  catalytic  nucleic 
acids  must  arise  from  random  polymers  of  nucleic  acids.  Searching  random  libraries  of 
nucleic  acid  polymers  has  not  produced  the  abundance  and  power  of  catalysis  believed 
necessary  to  spawn  life.  Further,  the  addition  of  chemical  functionality  to  random 
libraries  has  failed  to  improve  the  catalytic  potential  of  random  libraries  anticipated. 

This  apparent  discrepancy  between  observation  and  expectation  may  be 
attributable  to  several  shortcomings  of  the  experimental  design.  Active  catalysts  may  be 
lost  during  the  preparation  of  libraries,  or  during  the  enrichment  for  catalysts. 
Furthermore,  only  incomplete  descriptions  have  been  attempt  for  any,  focusing  on  only 
the  fastest  catalysts  in  the  library,  rather  than  capturing  completely  the  distribution  of 
catalytic  power.  It  is  conceivable  that  added  functionality,  or  alterations  of  the  library  or 
selection  conditions,  can  dramatically  alter  the  abundance  and  distribution  of  catalysts 
without  altering  the  rate  of  the  fastest  catalyst. 

The  goal  of  this  thesis  is  to  develop  methods  for  estimating  the  distribution  of 
catalysts  within  a  random  library  of  DNA  sequence.  To  aid  in  the  development  of 
realistic  models,  we  began  with  an  in  depth  analysis  of  the  catalytic  behavior  of  an 
individual  DNAzyme,  614.  We  also  addressed  the  problem  of  losing  catalysts  during  the 
preparation  of  DNAzyme  pools  by  examining  the  utility  of  a  protected  ribose  (caged- 
ribose)  in  both  kinetic  analysis  and  an  IVS.  Simulations  of  IVS  were  also  produced  and 
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used  to  examine  the  impact  of  various  selection  parameters  on  the  outcome  of  IVS 
experiments. 

Catalytic  Behavior  of  DNAzvmes  from  IVS  is  More  Complex  then  Previously  Assumed 

Catalysts  isolated  from  a  random  library  using  the  Breaker- Joyce  protocol  are 
assumed  to  be  dependent  on  the  cofactor  used  in  the  selection.  Further,  cleavage  is 
assumed  to  follow  a  unimolecular  mechanism.  Analysis  of  614  revealed  that  both  of 
these  assumptions  are  violated. 

DNAzyme  614  was  generated  with  1  mM  MgCl2  in  the  reaction  buffer  but  is  in 
fact  active  without  MgCl2.  Geyer  and  Sen  directly  selected  for  metal-independent 
DNAzymes  using  the  Breaker/Joyce  IVS  procedure  without  metal  cofactors  in  the  buffer, 
demonstrating  that  divalent  cations  are  not  absolutely  required  (1997).  Examination  of 
both  the  hairpin  ribozyme  and  hepatitis  delta  virus  ribozyme  has  revealed  that  Mg++  plays 
a  only  passive  role  (likely  structural),  and  instead  implicate  an  internal  nucleotide  in 
general  acid-base  catalysis  (Young  et  al  1997;  Hampel  and  Cowan,  1997;  Nakano  et  al, 
2000;  Rupert  et  al,  2002;  Shih  and  Been,  2002).  It  is  likely  that  614  utilizes  an  internal 
nucleotide  in  a  similar  manner. 

It  was  also  observed  that  614  cleavage  is  concentration-dependent,  prompting  a 
study  of  its  bimolecular  cleavage  process.  This  revealed  that  614  can  cleave  a  number  of 
substrates  in  trans,  and  that  the  rate  of  trans-cleavage  at  saturation  is  6-fold  higher  than 
cis  cleavage.  The  balance  between  c/s-cleavage  and  /ra«s-cleavage  is  concentration- 
dependent.  The  cleavage  rate  of  614  acting  in  cis  and  trans  is  enhanced  by  lowering  the 
temperature,  likely  indicating  that  the  impact  of  lower  temperature  on  the  formation  of 
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the  reactive  folded  form  is  greater  than  the  impact  on  the  rate  constant  for  the  chemical 
step. 

The  fact  that  the  balance  between  uni-  and  bi-molecular  interaction  is 
concentration  dependent  may  explain  the  observation  by  Geyer  and  Sen  (1997)  and 
Breaker  and  Joyce  (1995)  that  cleavage  rates  decreased  when  DNAzymes  produced  with 
additional  cycles  of  PCR  (20  vs.  5)  were  examined.  Presumably  because  they  considered 
only  unimolecular  interactions,  these  researchers  did  not  report  the  concentrations  of 
DNAzymes  in  their  assays  (and  presumably  they  did  not  control  DNAzyme 
concentration).  We  predict,  therefore,  that  the  decrease  in  rate  with  increasing  cycles  of 
PCR  resulted  from  an  increased  concentration  of  their  DNAzymes.  If  kbi  <  kuni  for  their 
particular  DNAzymes,  higher  concentrations  would  favor  a  slower  rate  process. 

Although  divalent  cations  can  affect  the  folding  properties  of  nucleic  acids, 
cofactors  (such  as  Mg++)  and  functionalized  nucleotides  are  often  assumed  to  enhance  the 
rate  of  the  chemical  step.  The  observation  that  the  rate  limiting  step  for  614-catalysis 
includes  a  folding/association  step  may  explain  why  Mg++  is  not  required  for  catalysis.  It 
remains  curious  that  a  selection  process  would  generate  Mg-independent  catalysts  given 
the  availability  of  Mg++.  This  suggest  that  the  probability  of  find  a  better  catalyst  that 
uses  Mg++  is  less  than  the  probability  of  find  a  catalyst  that  does  not.  This  also  suggests 
that,  in  order  for  cofactors  (or  covalently  attached  functionalities)  to  improve  the  rate  of 
catalysis,  the  rate  limiting  step  must  not  be  the  folding  of  the  DNAzyme,  but  instead  the 
chemical  step.  The  impact  of  the  chemical  step  can  be  increased  relative  to  the  folding 
step  by  increasing  the  cofactor  concentration,  or  increasing  the  binding  strength  of  E»S  or 
E»cofactor. 
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Enzyme  folding  is  particularly  problematic  for  nucleic  acid  enzymes  because 
folding  requires  perpendicular  "face-to-face"  interactions  (more  constrained),  and 
alternative  conformations  are  relatively  common  (as  structure  predictions  for  614  reveal). 
Nucleic  acid-polymer  folding  (with  standard,  non-functionalize  nucleotides)  may  be  a 
slow  search  for  optimal  (active)  structures,  full  of  dead-end  (inactive)  traps.  This  is  in 
contrast  to  the  funnel-shaped  folding  landscape  predicted  for  proteins  (Ozkan,  2001) 
(recognizing  that  natural  proteins  may  have  evolved  into  regions  of  sequence  space  that 
have  this  property).  Functionalities  added  to  nucleic  acids  that  improve  folding  strength 
and  improve  search  properties  may  be  critical. 

Stanley  Miller  recently  proposed  that  prebiotic  synthesis  of  nucleic  acids  is 
enhanced  in  cold  environments,  and  postulated  this  may  be  evidence  for  cold  origins  of 
single-biopolymer  life-form  (Miyawaka,  2001).  To  our  knowledge,  our  research  is  the 
first  demonstration  of  better  catalysis  at  temperatures  lower  than  those  used  during  the 
selection.  In  a  prebiotic  world  in  which  nucleic  acid  folding  may  be  problematic,  a  cold 
environment  may  have  aided  synthesis  of  monomers  and  folding  of  polymers  in  a  nucleic 
acid-based  life. 

The  complex  nature  of  614  inter-  and  intra-molecular  cleavage  gives  us  insight 
into  how  a  single-biopolymer  hypercycle  may  have  originated.  The  earliest  stage  of  life 
requires  that  catalysts  (specifically  a  replicase)  act  on  various  different  substrates. 
Studies  of  the  impact  of  compartition  on  evolution  have  demonstrated  that  some 
mechanism  must  exist  to  keep  selfish  molecules  from  overwhelming  critical  functions 
(Matsuura,  2002).  One  method  of  achieving  compartition  relies  on  cellular  membranes. 
It  is  plausible,  however,  that  before  hypercycles  were  enclosed  in  membranes,  catalytic 
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nucleic  acids  had  to  rely  on  sequence  tags  to  distinguish  molecules  within  the  hypercycle 
from  selfish  (competitor)  molecules  outside  the  hypercycle.  Without  overt  selection,  614 
possessed  qualities  that  allow  for  this  type  of  distinct  regulation,  acting  differentially  in 
cis  and  trans  (even  for  closely  related  molecules)  at  various  concentrations.  These 
complex  patterns  of  regulation  are  possible  because  of  the  generality  of  base  pairing,  and 
thus  appears  to  be  intrinsic  and  abundant  to  nucleic  acids  (and  therefore  do  not  need  to  be 
"engineered").  The  complex  RNA-based  regulatory  mechanisms  now  appreciated  within 
modern  organisms  may  be  analogous  to  the  complex  regulation  achieved  by  the  single 
biopolymer  progenitor. 

Caged-ribose  is  Useful  for  Kinetics,  But  Increases  Leakage  Too  Much  for  IVS 
There  are  several  places  in  the  IVS  protocol  where  catalysts  may  be  lost  prior  to 
the  selection  step.  We  examined  the  utility  of  a  caged-ribose  to  guard  against  this.  The 
caged-ribose  proved  useful  in  protecting  the  sample  from  "premature  catalysis,"  yet  when 
deprotected  via  photolysis,  produced  kinetic  results  similar  to  those  generated  with 
unprotected  ribose.  Unfortunately,  the  laser  photolysis  generates  a  higher  amount  of 
background  cleavage  (leakage)  which  resulted  in  the  failure  to  enrich  for  active  catalysts 
when  used  during  an  IVS.  Simulations  of  IVS  revealed  that  even  slight  increases  in  this 
sequence-independent  cleavage  parameter  (analogous  to  other  forms  of  "leakage" 
inherent  in  the  column-selection  protocol)  significantly  delay  enrichment  of  catalysts. 
We  have  also  developed  alternatives  to  the  standard  Breaker- Joyce  method  for 
isolating  catalytic  strand  DNAzymes  and  for  performing  solid-phase  selections. 
Asymmetric  PCR  and  exonuclease  degradation  were  effective  methods  for  purifying  the 
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catalytic  strand  of  the  double  stranded  DNAzyme.  Both  methods  avoid  many  of  the 
experimental  weaknesses  of  the  Breaker-Joyce  protocol,  and  permit  selections  to  be 
perfomed  in  solution.  Even  without  the  benefit  of  a  protected  ribose,  these  improvements 
should  dramatically  improve  future  selections  by  minimizing  the  loss  of  catalysts  due  to 
column  leakage  and  NaOH-based  DNAzyme  duplex  separation. 

IVS  Geneates  Sequence  Variants  of  614  with  Wide  Range  of  Catalytic  Power 
Molecules  isolated  from  round  8  of  Selection  H  ("ang  +  ribose"  with  standard 

nucleotides)  selection  fall  into  several  groups: 

(a)  "Singletons,"  which  are  unlike  any  other  sequence.  Each  of  the  singletons  tested 
had  an  extremely  slow  rate  of  cleavage  (in  comparison  to  other  competing 
molecules).  The  fact  that  these  molecules  persist  through  8  rounds  of  selection 
despite  extremely  slow  rates  of  cleavage  suggests  that  they  may  be  surviving 
because  they  are  good  substrates  for  other  catalysts.  If  this  scenario  is  true,  they 
may  be  parasitic  (they  directly  compete  with  the  active  catalyst  for  survival),  or 
symbiotic  (they  enhance  the  self-cleavage  of  the  enzyme  that  also  cleaves  them). 
Selection  is  expected  to  continuously  minimize  the  population  of  parasitic 
substrates.    If  these  "singletons"  are  in  fact  symbiotic  molecules,  then  the 
survival  of  both  the  enzyme  and  substrate  follows  a  "kin-selection  model." 

(b)  Family  alpha  molecules  first  appear  after  Round  1  of  selection,  and  relatives  persist  in 
the  population  for  several  rounds  until  the  family  apparently  goes  extinct  after 
round  4.  One  representative  was  tested  and  it  had  a  rate  of  cleavage  near  zero, 
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suggesting  that,  like  the  singletons,  survival  may  have  been  based  on  being  a  good 
substrate  rather  than  a  good  catalyst, 
(c)  Family  beta,  gamma,  and  delta  are  all  very  similar  to  previously  selected  catalysts  (but 
nearly  all  differ  by  at  least  one  nucleotide),  suggesting  contamination,  a  non- 
random  library,  or  PCR  bias.  Each  of  these  possibilities  is  being  examined 
further. 

The  sequences  of  Family  beta,  gamma,  and  delta  all  show  a  great  deal  of  sequence 
similarity  within  each  Family,  as  well  as  between  each  Family.  Within  a  Family,  a 
change  of  a  few  nucleotides  was  shown  to  change  the  rate  up  or  down  by  an  order  of 
magnitude.  If  the  occurrence  of  similar  catalysts  from  two  independent  selections  arose 
from  contamination,  the  sequence  variation  within  the  families  from  the  second  selection 
arose  through  divergence,  suggesting  that  the  landscape  surrounding  the  original  parent 
strains  is  well  populated  with  active  sequences,  providing  a  relatively  smooth  landscape 
and  allowing  evolution  to  proceed  easily. 

If  the  occurrence  of  similar  catalysts  from  two  independent  selections  arose 
through  convergence,  this  implies  that  the  library  must  not  have  been  (much)  larger  than 
the  original  sampling  (10   ).  Within  an  order  of  magnitude,  we  know  of  at  least  10 
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active  catalysts  from  the  library,  implying  that  catalysts  are  present  no  less  than  10/10 
=  1010  (and  could  be  much  more  frequent  if  the  library  is  smaller  than  1012). 

Given  the  abundance  of  catalysts  differing  by  only  a  few  nucleotides,  methods  of 
estimating  catalytic  abundance  based  on  restriction  digestion  (Bartel,  1993)  will  grossly 
underestimate  the  true  catalytic  abundance  since  they  can  not  distinguish  between  these 
similar  catalysts. 
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Simulations  Allow  Estimation  of  Catalyst  Distribution 
The  impact  of  various  selection  parameters  and  library  parameters  was  examined 
with  simulations  of  the  IVS  procedure.  When  the  selection  time  (St)  is  very  short 
compared  to  the  half-life  of  the  fastest  catalysts  in  the  initial  library,  the  stabilized 
population  is  cleaved  less  than  95%  at  the  completion  of  each  round  of  selection, 
meaning  that  the  population  contains  only  the  fastest  catalysts.  This  scenario  describes 
the  selection  regime  of  most  in  vitro  selections,  and  thus  they  have  focused  only  on  the 
rate  of  the  fastest  molecules. 

If  functionality  were  to  dramatically  improve  the  abundance  of  catalysts  with 
rates  slower  than  kfastest,  then  these  previous  experiments  only  allow  a  comparison  of  the 
fastest  molecules.  The  origin  of  life  does  not  likely  depend  on  the  spontaneous 
generation  of  a  relatively  rare,  fast  catalyst,  but  rather  on  the  spontaneous  -  and  relatively 
frequent  -  generation  of  any  catalysts.  Thus,  a  full  description  of  the  distribution  of  fast 
and  slow  catalysts  in  a  random  library  is  needed  if  we  are  to  understand  how  to  improve 
the  catalytic  power  of  the  library  is.  The  simulation  reveals  a  method  for  doing  just  this: 
Performing  selections  with  a  range  of  selection  times  allows  estimation  of  both 
fast  and  slow  catalyst.  When  the  St~  kfastest,  only  the  fastest  molecules  are 
isolated.  Increasing  selection  time  such  that  St »  kfastest  creates  a  stabilized 
population  such  that  the  proportionality  between  fastest  and  slower  catalysts  after 
selection  remains  similar  to  the  initial  population.  This  allows  estimation  of  the 
initial  population  distribution  by  direct  examination  of  the  selected  population. 
•  The  population  of  catalysts  in  initial  library  can  be  described  sufficiently  with  two 

parameters,  the  Reactive  Fraction  (RF)  and  the  Distribution  Function.  The 
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leakage  parameter  and  kfastest  parameters  can  be  experimentally  determined.  The 
experimentally  measurable  outcome  of  selection  experiments  (progress  curve  for 
cleavage  at  each  round  of  selection)  can  be  fit  to  the  simulation  model  in  which 
the  RF  and  distribution  function  are  varied.  Simulations  have  revealed  that  model 
parameters  such  as  RF  and  population  distribution  have  a  significant  impact  on 
the  outcome  of  selection  experiments,  suggesting  that  relatively  small  range  of 
parameters  should  fit  the  experimental  data. 

Nucleic  Acid  Catalysis  and  the  Origins  of  Life 
Given  the  apparent  infrequency  of  nucleic  acid  enzymes  in  random  libraries,  is  it 
still  reasonable  to  assume  that  life  originated  with  nucleic  acids  as  the  basis  of  a  single- 
biopolymer  life-form?  Based  on  the  catalytic  power  of  proteins,  functionalized 
nucleotides  were  expected  to  dramatically  improve  both  power  and  frequency  of  nucleic 
acid  catalyst  in  random  libraries.  To  date,  this  approach  has  not  yet  bridged  the  gap 
between  what  is  generated  in  the  lab  and  what  is  expected  to  be  sufficient  for  the  genesis 
of  life.  Nonetheless,  abundant  evidence  argues  that  nucleic  acids  can  in  fact  catalyze 
complex  reactions,  as  they  currently  do  in  the  spliceosome  and  ribosome,  and  as 
reconstructions  of  the  RNA  world  predict  (Benner,  1989). 

Perhaps  nucleic  acids  are  not  just  limited  by  the  absence  of  the  chemical 
functionality  that  enhance  catalysis  (as  common  in  proteins),  but  also  by  the  precise  base- 
pairing  that  is  required  of  nucleic  acids  as  the  genetic  system.  The  standard  nucleic  acids 
are  the  product  of  stringent  selection  for  polymers  that  fold  well  in  a  linear  fashion  but  do 
not  fold  into  three  dimension  structures.  Catalytic  nucleic  acids  such  as  the  ribosome  and 
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spliceosome  often  contain  modified  nucleotides  which,  along  with  associated  proteins, 
can  improve  structural  arrangement.  In  the  future,  the  incorporation  of  functionalized 
nucleotides  that  enable  catalysis  and  folding  into  in  vitro  selections  may  dramatically 
improve  the  catalytic  potential  of  random  nucleic  acid  libraries. 
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